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NUCLEOTIDE SEQUENCES OF 
SOYBEAN ACYL-ACP THIOESTERASE GENES 
t 5 CRQgS-MFERB WCE TO RELATED APPLICATION 

This application is a continuation-in-part of 
application U.S. Serial No. 07/631,264, filed 

December 20, 1990. 

ETET.D OF THE INVENTION 

10 The invention relates to isolated nucleic acid 

fragments that encode plant seed acyl-ACP thioesterase 
enzymes or its _ precursor . Such fragments are useful in 
a method to alter plant oil composition. 

BACKGROUND OF THE INVENTION 

15 Soybean is the lowest-cost source of vegetable oil. 

Soybean oil accounts for about 70% of the 14 billion 
pounds of edible oil consumed in the United States and 
is a major edible oil worldwide. It is used in baking, 
frying, salad dressing, margarine, and a multitude of 

20 processed foods. Soybean is agronomically well-adapted 
to many parts of the U.S. In 1987/88 sixty million 
acres of soybean were planted in the U.S. Soybean 
products are also a major element of foreign trade since 
thirty million metric tons of soybeans, twenty-five 

25 million metric tons of soybean meal, and one billion 
pounds of soybean oil were exported in 1987/88. 
Nevertheless, increased foreign competition has lead to 
recent declines in soybean acreage and production in the 
U.S. The low cost and ready availability of soybean oil 

30 provides an excellent opportunity to upgrade this 

commodity oil into higher value speciality oils that add 
value to soybean crop for the U.S. farmer and enhance 
U.S. trade. 

The specific performance and health attributes of 
. 35 edible oils are determined largely by their fatty acid 
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composition. Soybean oil derived from commercial 
varieties is composed primarily of 11% palmitic (16:0) , 
4% stearic (18:0) , 24% oleic (18:1) , 54% linoleic (18:2) 
and 7% linolenic (18:3) acids. Palmitic and st aric 
5 acids are, respectively , 16- and 18-carbon-long, 

saturated fatty acids* Oleic, linoleic and linolenic 
are 18-carbon-long, unsaturated fatty acids containing 
one f two and three double bonds, respectively. Oleic 
acid is also referred to as a "monounsaturated" fatty 
10 acid, while linoleic and linolenic acids are also 
referred to as "polyunsaturated" fatty acids. The 
specific performance and health attributes of edible 
oils is determined largely by their fatty acid 
composition . 

15 Soybean oil is high in saturated fatty acids when 

compared to other sources of vegetable oil and contains 
a low proportion of oleic acid relative to the total 
fatty acid content of the soybean seed. These 
characteristics do not meet important health needs as 

20 defined by the American Heart Association. 

Recent research efforts have examined the role that 
monounsaturated fatty acid plays in reducing the risk of 
coronary heart disease. In the past, it was believed 
that monounsaturates, in contrast to saturates and 

25 polyunsaturates, had no effect on serum cholesterol and 
coronary heart disease risk. Several recent human 
clinical studies suggest that diets high in 
roonounsaturated fat may reduce the "bad" (low-density 
lipoprotein) cholesterol while maintaining the "good" 

30 (high-density lipoprotein) cholesterol. (See Mattson, 
et al. r Journal of Lipid Research (1985) 26:194-202). 
The significance of monounsaturated fat in the diet was 
confirmed by international researchers from seven 
countries at the Second Colloquium on Monounsaturated 
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Fats sponsored by the National Heart, Lung and Blood 
Institut s in 1987. 

Soybean oil is also relatively high in 
polyunsaturated fatty acids — at levels far in xcess 

i 

5 of essential dietary requirements. These fatty acids I 
oxidize readily to give off-flavors and reduce the p 
performance of unprocessed soybean oil. The stability 
and flavor of soybean oil is improved by hydrogenation, r 
which chemically reduces the double bonds. However, 

10 this processing reduces the economic attractiveness of 
soybean oil. 

A soybean oil low in total saturates and 
polyunsaturates and high in monounsaturate would provide 
significant health benefits to human consumers as well L 

15 as economic benefit to oil processors. Such soybean j= 
varieties will also produce valuable meal for use as 
animal feed. 

Another type of differentiated soybean oil is an 
edible fat for confectionary uses. More than two 

20 billion pounds of cocoa butter, the most expensive 
edible oil, are produced worldwide. The U.S. imports 
several hundred million dollars worth of cocoa butter 
annually. The high and volatile prices and uncertain 
supply of cocoa butter have encouraged the development i- 

25 of cocoa butter substitutes. The fatty acid composition 1 
of cocoa butter is 26% palmitic, 34% stearic, 35% oleic ! 
and 3% linoleic acids. Cocoa butter's unique fatty acid 
composition and distribution on the triglyceride 
molecule confer on it properties eminently suitable for 

30 confectionary end-uses: it is brittle below 27°C and 
depending on its crystalline state, melts sharply at 
25-30°C or 35-36°C. Consequently, it is hard and non- 
greasy at ordinary temperatures and melts very sharply j 
in the mouth. It is also extremely resistant to 

35 rancidity. For these reasons, a soybean oil with 
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increased levels of palmitic and st aric acids , 
especially in soybean lin s containing reduced levels of 
unsaturated fatty acids , is xpect d to provide a cocoa 
butter substitute in soybean. This will add value to 
5 oil and food processors as well as reduce the foreign 
import of certain tropical oils. 

The partial purification of acyl-ACP thioesterase 
was reported from saf flower seeds (McKeon et al., (1982) 
J. Biol. Chem. 257:12141-12147). However, this 
10 purification scheme was not useful for soybean, either 
because the thioesterases are different of because of 
the presence of other proteins such as the soybean seed 
storage proteins in seed extracts. 

SPMMftRY OF THE INVENTION 
15 A method to alter the levels of saturated and 

unsaturated fatty acids in edible plant oils has been 
invented. Isolated soybean seed acyl-ACP thioesterase 
cDNAs for either the precursor or enzyme were used to 
create chimeric genes. Transformation of plants with 
20 the chimeric genes alters the fatty acid composition of 
the seed oil. 

The invention is nucleic acid fragments comprising 
a nucleotide sequence encoding a plant acyl-ACP 
thioesterase. More specifically, the fragment may be 
25 isolated from soybean, oil producing Brassica species, 
Cuphea viscosissima or Cuphea lanceolate One fragment 
of the invention corresponds to nucleotides 1 to 1602 of 
SEQ ID N0:l f or any nucleic acid fragment substantially 
homologous therewith. Another fragment corresponds to 
30 nucleotides 1 to 1476 of SEQ ID N0:3, or any nucleic 
fragment substantially homologous therewith, more 
preferred nucleic acid fragments are nucleotides 106 to 
1206 of SEQ ID NO:l and nucleotides 117 to 1217 of SEQ 
ID NO: 3, or any nucleic acid fragment substantially 
35 homologous therewith for soybean seed acyl-ACP 
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thioesterase precursor. Also more preferred nucleic 
acid fragments are nucleotides 271 to 1206 of SEQ ID 
NO:l and nucleotides 282 to 1217 of SEQ ID NO:3 r or any 
nucleic acid fragment substantially homologous therewith 
5 for mature soybean seed acyl-ACP thioesterase. 

Another aspect of this invention is a chimeric gene 
capable of transforming a plant cell comprising a 
nucleic acid fragment encoding soybean seed acyl-ACP 
thioesterase cDNA operably linked to suitable regulatory 

10 sequences such that expression of the gene causes 
altered levels of acyl-ACP thioesterase in the seed. 
Preferred are those chimeric genes which incorporate 
nucleic acid fragments encoding soybean seed acyl-ACP 
thioesterase precursor or mature soybean seed acyl-ACP 

15 thioesterase enzyme. 

A further aspect of this invention is a plant 
transformed with the chimeric genes discribed above. 

Yet another embodiment of the invention is a method 
to produce seed oil containing altered levels of 

20 saturated and unsaturated fatty acids comprising: . 
(a) transforming a plant cell with a chimeric gene 
described above , (b) growing sexually mature plants from 
the transformed plant cells, (c) screening progeny seeds 
from the sexually mature plants of step (b) for the 

25 desired levels of palmitic and stearic acid, and (d) 
processing the progeny seed to obtain oil containing 
altered levels of palmitic and stearic acid. Preferred 
plant cells and oils are soybean, oil producing Brassies 
species r sunflower, cotton, cocoa, peanut, safflower, 

30 and corn. 

The invention is also embodied in a method of RFLP 
breeding to obtain altered levels of palmitic, stearic, 
and oleic acids in seed oil. This method comprises: 
(a) making a cross between two soybean varieties 

35 differing in the trait, (b) making a Southern blot of 



restriction enzyme-digested genomic DNA isolated from 
sev ral progeny plants resulting from the cross of st p 
(a) ; and (c) hybridizing the Southern blot with the 
radiolabeled nucl ic acid fragments described herein. 

ESIEE nESCRTPT THN OF THE SEQUENCE DESCRIPTIONS 

The invention can be more fully understood from the 
following detailed description and the Sequence 
Descriptions which form a part of this application. The 
Sequence Descriptions contain the three letter codes for 
amino acids as defined in 37 C.F.R. 1.822 which are 
incorporated herein by reference. The nucleotide 
sequences read from 5 f to 3 1 . 

SEQ ID N0:1 shows the 1602 nucleotides of a soybean 
seed acyl-ACP thioesterase cDNA. 

SEQ ID NO: 2 shows the amino acid sequence of the 
precursor protein of a soybean seed acyl-ACP 
thioesterase (the coding sequence of SEQ ID NO:l) . 

SEQ ID NO: 3 shows the 1476 nucleotides of a soybean 
seed acyl-ACP thioesterase cDNA. 

SEQ ID NO: 4 shows the amino acid sequence of the 
precursor protein of a soybean seed acyl-ACP 
thioesterase (the coding sequence of SEQ ID NO: 3) . 

SEQ ID NOs:5 and 6 show sequences related to the N- 
terminal sequence of acyl-ACP thioesterase. 

SEQ ID NOs:7, 8 and 9 show respectively a protein 
sequence, DNA sequence and the related hybridization 
probe . 

SEQ ID NOs:10, 11 and 12 show respectively a 
protein sequencer DNA sequence and the related 
hybridization probe. 

SEQ ID NO: 13 shows the sequence of the sequencing 
primer used to identify soybean acyl-ACP thioesterase 
isozymers . 

SEQ ID NOs:14, 15, 16 and 17 show sequences chosen 
from SEQ ID NO:l as probes for identification of 
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acyl-ACP thioesterase genes from- the £. viscosissima 
£. lanceolata genomes. 

SEQ ID NO: 18 shows a PCR primer corresponding to 
bases 83 through 117 in SEQ ID N0:1. j 
5 SEQ ID NO: 19 shows a PCR primer corresponding to \? 

bases 274 through 296 in SEQ ID N0:1. f 

SEQ ID NO: 20 shows an 1378 base pair, partial 
genomic clone of acyl-ACP thioesteraswe from £. x»BU&. 
SEQ ID NO: 21 shows an 865 base pair insert 
10 sequenced from £. yiseosissima. 

SEQ ID NO: 22 shows an 852 base pair insert 
sequenced from £. lanceolata. 

amiLSD DESCRIPTION OF THE INVENTION j_ 
The present invention describes two isolated j_ 
15 nucleic acid fragments that encode soybean seed acyl-ACP f 
thioesterases . These enzymes catalyze the hydrolytic 
cleavings of palmitic acid f stearic acid and oleic acid L 
from ACP in the respective acyl-ACPs. 

Only recently have serious efforts been made to 
20 improve the quality of soybean oil through plant 

breeding, especially mutagenesis. A wide range of 
fatty acid compositions have been discovered in 
experimental lines of soybean (Table 1) . Findings from 
work on various oil crops suggest that the fatty acid |_ 
25 composition of soybean oil can be significantly altered j 
without affecting the agronomic performance of a soybean ! 
plant. However, there is no soybean mutant line with 
levels of saturates less than those present in 
commercial canola, the major competitor to soybean oil 
30 as a "healthy" oil. 
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TABLE 1 
Range of Fatty Acid 

Percentages Produced bv Soybean Mutants 



5 



Fatty acids 
Palmitic Acid 
Stearic Acid 



Range of t 



6-28 



3-30 



Oleic Acid 
Linoleic Acid 
Linolenic Acid 



17-50 



35-60 



10 



3-12 



There are serious drawbacks to using mutagenesis to 
alter fatty acid composition. It is unlikely to 
discover mutations a) that result in a dominant ("gain- 

15 of-function") phenotype f b) in genes that are essential 
for plant growth r and c) in an enzyme that is not rate- 
limiting and that is encoded by more than one gene. 
Even when some of the desired mutations are available in 
soybean mutant lines their introgression into elite 

20 lines by traditional breeding techniques will be slow 
and expensive, since the desired oil compositions in 
soybean are most likely to involve several recessive 
genes . 



25 offer the potential for overcoming some of the 

limitations of the mutagenesis approach, including the 
need for extensive breeding. Particularly useful 
technologies are: a) seed-specific expression of foreign 
genes in transgenic plants (see Goldberg et al., (1989) 

30 Cell 56:149-160), b) use of antisense RNA to inhibit 
plant target genes in a dominant and tissue-specific 
manner (see van der Krol et al., (1988) Gene 72:45-50), 
c) use of homologous transgenes to suppress native gene 
expression (see Napoli et al., (1990) The Plant Cell 

35 2:279-289; van der Krol et al., (1990) The Plant Cell 



Recent molecular and cellular biology techniques 
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2:291-299; Smith et al., (1990) Mol. Gen. Genetics 
224:477-481), d) transfer of foreign genes into elite 
commercial varieties of commercial oilcrops, such as 
soybean (Chee et al. (1989) Plant Physiol. 91:1212-1218; 
5 Christou et al., (1989) Proc. Natl. Acad. Sci. U.S.A. 
86:7500-7504; Hinchee et al., (1988) Bio/Technology 
6:915-922; EPO publication 0 301 749 A2), rapeseed (De 
Block et al., (1989) Plant Physiol. 91:694-701], and 
sunflower (Everett et al., ,(1987) Bio/Technology 

10 5:1201-1204), and e) use of genes as restriction 
fragment length polymorphism (RFLP) markers in a 
breeding program, which makes introgression of recessive 
traits into elite lines rapid and less expensive 
(Tanksley et al. (1989) Bio/Technology 7:257-264). 

15 However, each of these technologies requires 

identification and isolation of commercially-important 
genes . 

Oil biosynthesis in plants has been fairly well- 
studied (see Harwood (1989) in Critical Reviews in Plant 

20 Sciences, Vol. 8 (l):l-43). The biosynthesis of 

palmitic, stearic and oleic acids occurs in the plastids 
of plant cells by the interplay of three key enzymes of 
the "ACP track": palmitoyl-ACP elongase, stearoyl-ACP 
desaturase and acyl-ACP thioesterase. 

25 Of these three enzymes, acyl-ACP thioesterase 

removes the acyl chain from the carrier protein (ACP) 
and thus from the metabolic pathway. The same enzyme, 
with slightly differing efficiency, catalyzes the 
hydrolysis of the palmitoyl, stearoyl and oleoyl-ACP 

30 thioesters. This multiple activity leads to substrate 
competition between enzymes and it is the competition of 
acyl-ACP thioesterase and palmitoyl-ACP elongase for the 
same substrate and of acyl-ACP thioesterase and 
stearoyl-ACP desaturase for the same substrate that 
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leads to the production of a particular ratio of 
palmitic, stearic and oleic acids. 

Once removed from the ACP track by the action of 
acyl-ACP thioesterase, fatty acids are exported to the 
5 cytoplasm and there used to synthesize acyl-coenzyme A 
(CoA) . These acyl-CoA's are the acyl donors for at 
least three different glycerol acylating enzymes 
(glycerol-3-P acyltransf erase, l-acyl-glycerol-3-P 
acyltransf erase and diacylglycerol acyltransf erase) 

10 which incorporate the acyl moieties into 
triacylglycerides during oil biosynthesis. 

These acyltransferases show a strong, but not 
absolute, preference for incorporating saturated fatty 
acids at positions 1 and 3 and monounsaturated fatty 

15 acid at position 2 of the triglyceride. Thus, altering 
the fatty acid composition of the acyl pool will drive 
by mass action a corresponding change in the fatty acid 
composition of the oil. Furthermore, there is 
experimental evidence that, because of this specificity, 

20 given the correct composition of fatty acids, plants can 
produce cocoa butter substitutes (Bafor et al. r (1990) 
J. Amer. Oil Chemists Soc. 67:217-225). 

Based on the above discussion, one approach to 
altering the levels of palmitic, stearic and oleic acids 

25 in vegetable oils is by altering their levels in the 
cytoplasmic acyl-CoA pool used for oil biosynthesis. 

It should be possible to genetically modulate the 
competition both between palmitoyl-ACP elongase and 
acyl-ACP thioesterase and between stearoyl-ACP 

30 desaturase and thioesterase by modulating the expression 
level- of thioesterase. While alteration of stearoyl-ACP 
desaturase activity either upward or downward may change 
the existing ratio of oleaterstearate and similarly 
altered expression of palmitoyl-ACP elongase might lead 

35 to new palmitate: (stearate + oleate) ratios, only 
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modification of the acyl-ACP thioesterase activity 
expected to change the amounts of both palmitate i 
stearate with one g netic manipulation ♦ Increas c 
competition leading to increas d levels of palmiti 
5 stearic acids would result from over-expression oi 
cloned and re-introduced thioesterase genes which 
more proven technology, while decreased competitic 
leading to decreased total saturated fatty acid wc 
result from expression of antisense message from t 

10 acyl-ACP thioesterase gene. The simultaneous and 
opposite manipulation of the palmitoyl-ACP elongas 
stearoyl-ACP desaturase activities would be reguir 
achieve these same effects. There are thus two 
advantages to the use of nucleotide sequences enco 

15 the acyl-ACP thioesterase to increase saturated fa 
acid content in vegetable oil over the manipulatio 
the other two mentioned enzymes: 1) the manipulat 
does not require antisense technology and 2) both 1 
palmitate and stearate levels should be elevated w 

20 one genetic manipulation. 

Transfer of one or both of these nucleic acid 
fragments of SEQ ID NOs:l and 3 of the invention o: 
part thereof that encodes a functional enzyme, wit] 
suitable regulatory sequences into a living cell w: 

25 result in the production or over-production of acyi 
thioesterase, which may result in increased levels 
palmitic and stearic acids in cellular lipids, incJ 
oil. 

Transfer of the nucleic acid fragment or fragi 
30 of the invention, with suitable regulatory sequence 
that transcribe the present cDNA, into a plant havi 
endogenous seed acyl-ACP thioesterase substantially 
homologous with the present cDNA may inhibit by 
cosuppression the expression of the endogenous acyl 
35 thioesterase gene and, consequently, result in a 
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decreased amount of palmitic and stearic acids in the 
seed oil (Jorgenson f Trends Biotech. (1990) 340-344) . 

Transfer of the nucleic acid fragment or fragments 
of the invention into a soybean plant with suitable 
5 regulatory sequences that transcribe the antisense RNA f~ 
complementary to the mRNA, or its precursor , for seed 
acyl-ACP thioesterase may inhibit the expression of the 
endogenous acyl-ACP thioesterase gene and, consequently, 
result in reduced amounts of palmitic and stearic acids 
10 in the seed oil. 

The nucleic acid fragments of the invention can 
also be used as restriction fragment length polymorphism 
(RFLP) markers in soybean genetic studies and breeding 
programs . 

15 DEFINITIONS f 

In the context of this disclosure, a number of 
terms shall be utilized. As used herein, the term 
"nucleic acid" refers to a large molecule which can be 
single stranded or double stranded, composed of monomers 

20 (nucleotides) containing a sugar, phosphate and either a 
purine or pyrimidine. A "nucleic acid fragment" is a 
fraction of a given nucleic acid molecule. In higher 
plants, deoxyribonucleic acid (DNA) is the genetic 

material while ribonucleic acid (RNA) is involved in the i_ 
25 transfer of the information in DNA into proteins. A j 
"genome" is the entire body of genetic material * 
contained in each cell of an organism. The term 
"nucleotide sequence" refers to a polymer of DNA or RNA 
which can be single- or double-stranded, optionally 
30 containing synthetic, non-natural or altered nucleotide 
bases capable of incorporation into DNA or RNA polymers. 
As used herein, the term "homologous to" refers to the 
complementarity between the nucleotide sequence of two 
nucleic acid molecules or between the amino acid 
35 sequences of two protein molecules. Estimates of such 
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homology ar provided by either DNA-DNA or DNAHRNA 
hybridization under conditions of stringency as is well 
understood by those skilled in the art (as describ d in 
Hames and Higgins < ds.) Nucleic Acid Hybridisation, IRL 
5 Press, Oxford, U.K. (1985)); or by the comparison of 

sequence similarity between two nucleic acids or I 
proteins. As used herein, "substantially homologous" 
refers to nucleic acid molecules which require less L. 
stringent conditions of hybridization than those for 

10 homologous sequences, and coding DNA sequence which may 
involve base changes that do not cause a change in the 
encoded amino acid, or which involve base changes which 
may alter an amino acid, but not affect the functional 
properties of the protein encoded by the DNA sequence. T 

15 Thus, the nucleic acid fragments described herein p 

include molecules which comprise possible variations of 
the nucleotide bases derived from deletion, 
rearrangement, random or controlled mutagenesis of the 
nucleic acid fragment, and even occasional nucleotide 

20 sequencing errors so long as the DNA sequences are 
substantially homologous. 

"Gene" refers to a nucleic acid fragment that 
expresses a specific protein, including regulatory 
sequences preceding (5 1 non-coding) and following (3 1 ^ 

25 non-coding) the coding region. "Acyl-ACP thioesterase 

gene" refers to a nucleic acid fragment that expresses a [ 
protein with acyl-ACP thioesterase activity. "Native" 
gene refers to the gene as found in nature with its own 
regulatory sequences. "Chimeric" gene refers to a gene 

30 that is comprised of. heterogeneous regulatory and coding 
sequences. "Endogenous" gene refers to- the native gene 
normally found in its natural location in the genome. A 
"foreign" gene refers to a gene not normally found in 

the host organism but that is introduced by gene '■ 

1 

35 transfer. 
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"Coding sequence" refers to a DNA sequence that 
codes for a sp cific protein and xcludes the non-coding 
sequences. It may constitute an "unint rrupt d coding 
sequence", i.e., lacking an intron, such as in a cDNA or 
5 it may include one or more introns bounded by 

appropriate splice junctions. An "intron" is a sequence ,■ 
of RNA which is transcribed in the primary transcript 
but which is removed through cleavage and re-ligation of 
the RNA within the cell to create the mature mRNA that 
10 can be translated into a protein. 

"Initiation codon" and "termination codon" refer to 
a unit of three adjacent nucleotides in a coding 
sequence that specifies initiation and chain 
termination, respectively, of protein synthesis (mRNA 
15 translation) . "Open reading frame" refers to the amino 
acid sequence encoded between translation initiation and 
termination codons of a coding sequence. 

"RNA transcript" refers to the product resulting 
from RNA polymerase-catalyzed transcription of a DNA 
20 sequence. When the RNA transcript is a perfect 

complementary copy of the DNA sequence, it is referred 
to as the primary transcript or it may be a RNA sequence 
derived from posttranscriptional processing of the 
primary transcript and is referred to as the mature RNA. 
25 "Messenger RNA" (mRNA) refers to the RNA that is without 
introns and that can be translated into protein by the 
cell. "cDNA" refers to a double-stranded DNA that is 
complementary to and derived from mRNA. "Sense" RNA 
refers to RNA transcript that include the mRNA. 
30 "Antisense RNA" refers to a RNA transcript that is 
complementary to all or part of a target primary 
transcript or mRNA and that blocks the expression of a 
target gene by interfering with the processing, 
transport and/or translation of its primary transcript 
35 or mRNA. The complementarity of an antisense RNA may be 



15 

with any part of the specific gene transcript, i.e., at 
the 5 1 non-coding sequence, 3' non-coding sequence, 
introns, or the coding sequence. In addition, as used 
herein, antisense RNA may contain regions of ribozyme 
5 sequences that may increase the efficacy of antisense 

RNA to block gene expression. "Ribozyme 1 ' refers to a j 

catalytic RNA and includes sequence-specific 

endoribonucleases. 

As used herein, "suitable regulatory sequences" 

10 refer to nucleotide sequences located upstream (5 f )r 
within, and/or downstream (3 1 ) to a coding sequence, 
which control the transcription and/or expression of the 
coding sequences, potentially in conjunction with the 
protein biosynthetic apparatus of the cell. In 

15 artificial DNA constructs regulatory sequences can also j 
control the transcription and stability of antisense 
RNA. 

"Promoter" refers to a DNA sequence in a gene, 
usually upstream (5 f ) to its coding sequence, which 

20 controls the expression of the coding sequence by 

providing the recognition for RNA polymerase and other 
factors required for proper transcription. In 
artificial DNA constructs promoters can also be used to 
transcribe antisense RNA. Promoters may also contain i 

25 DNA sequences that are involved in the binding of 1 
protein factors which control the effectiveness of * 
transcription initiation in response to physiological or 
developmental conditions. It may also contain enhancer 
elements. An "enhancer" is a DNA sequence which can 

30 stimulate promoter activity. It may be an innate 
element of the promoter or a heterologous element 
inserted to enhance the level and/or tissue-specificity 
of a promoter. "Constitutive promoters" refers to those 
that direct gene expression in all tissues and at all ; 

35 times. "Tissue-specific" or "development-specific" ' 
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promot rs as referr d to h rein are those that direct 
gene xpression almost exclusively in specific tissues, 
such as leaves or seeds, or at specific development 
stages in a tissue, such as in early or late 
5 embryogenesis, respectively. p 
The term "expression", as used herein, is intended j- 
to mean the production of a functional end-product. 
Expression or overexpression of the acyl-ACP 
thioesterase genes involves transcription of the gene 

10 and translation of the mRNA into precursor or mature 
acyl-ACP thioesterase proteins. "Antisense inhibition 
refers to the production of antisense RNA transcripts 
capable of preventing the expression of the target j= 
protein. "Overexpression" refers to the production of a 

15 gene product in transgenic organisms that exceeds levels 
of production in normal or non-transformed organisms. 
"Cosuppression" refers to the expression of a transgene 
which has substantial homology to an endogenous gene 
resulting in the suppression of expression of both the 

20 ectopic and the endogenous gene. 

"Altered expression" refers to the production of 
gene product (s) in transgenic organisms in amounts or 
proportions that differ significantly from that activity 
in comparable tissue (organ and of developmental type) 

25 from wild-type organisms. 

The "3* non-coding sequences" refers to the DNA 
sequence portion of a gene that contains a 
polyadenylation signal and any other regulatory signal 
capable of affecting mRNA processing or gene expression. 

30 The polyadenylation signal is usually characterized by 
affecting the addition of polyadenylic acid tracts to 
the 3' end of the mRNA precursor. 

"Mature" protein refers to a functional acyl-ACP 
thioesterase enzyme without its transit peptide. 

35 "Precursor" protein refers to the mature protein with a 



native or foreign transit peptide. "Transit 11 peptide 
refers to the amino terminal extension of a polypeptide, 
which is translated in conjunction with the polyp ptide 
forming a precursor peptide and which is required for 
its uptake by plastids of a cell. 

"Transformation" herein refers to the transfer of a 
foreign gene into the genome of a host organism and its 
genetically stable inheritance. "Restriction fragment 
length polymorphism" refers to different sized 
restriction fragment lengths due to altered nucleotide 
sequences in or around variant forms of genes. 
"Fertile" refers to plants that are able to propagate 
sexually. 

"Oil producing species" herein refers to plant 
species which produce and store triacylglycerol in 
specific organs, primarily in seeds. Such species 
include soybean, canola, sunflower, cotton, cocoa, 
peanut, saf flower and corn. The group also includes 
non-agronomic species which are useful in developing 
appropriate expression vectors such as tobacco and 
ArafridPPSis thaliana, and wild species which may be a 
source of unique fatty acids. 

Purification of Soybean Seed Acvl-ACP IhjoafitfiEafig 
In order to modulate the activity of acyl-ACP 
thioesterase in the seed, it is essential to isolate or 
purify the complete gene(s) or cDNA(s) encoding the 
target enzyme (s). 

Acyl-ACP thioesterase proteins were purified to a 
protein mixture containing either two or three peptides 
when analyzed by SDS polyacrylamide gel electrophoresis 
(SDS-PAGE) starting from the soluble fraction of 
extracts made from developing soybean seeds following 
binding to DEAE-cellulose, ammonium sulfate 
precipitation, chromatographic separation on blue 
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sepharose, high performance anion exchange, alkyl-ACP 
sepharose, and phenyl-Superose. In a typical 
preparation, the fold purification of thio sterase 
activity was about 8500, The preparation runs as a 
5 single band in native polyacrylamide gel 

electrophoresis, and as a single, symmetrical peak in 
gel filtration chromatography indicating a native 
molecular weight of about 75 kD. SDS-PAGE of these 
preparations showed a very minor peptide of about 39 kD 

10 and two major peptides at about 33 and 34 kD. 

Polyclonal antibodies raised to each of these 
peptides individually in mice cross-reacted in all 
combinations upon Western analysis indicating that the 
peptides are antigenically very similar. All attempts 

15 at separating these three peptides with retention of 

thioesterase activity failed. The peptides at 33 and 34 
kD could be separated from the 39 kD peptide by reverse 
phase chromatography on a diphenyl matrix. The mixture 
containing these two peptides was analyzed by N-terminal 

20 amino acid sequencing and one main sequence of the 
following amino acid order was found: 

Arg-Val-Glu-Ala-Pro-Gly-Gly-Thr-Leu-Ala-Asp-Arg-Leu 
(SEQ ID NO: 5). 

25 

These results lead to the conclusion that the 
native thioesterase enzyme is a heterodimer of at least 
three polypeptides with similar amino acid sequences and 
nearly identical N-termini. Whether the mixture arises 
30 as the result of the expression of slightly dissimilar 
genes or as the result of heterogenous proteolytic 
processing at the carboxyl terminal of the product of 
one gene of or identical genes is not known. 
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Cloning of finvh^an Seed Acvl-ACP Thioest.erase CDNA 
The combined 33 and 34 kD peptides from reversed 
phase purification were denatured and reduced with 
dithiothr itol (DTT) , then alkylated with vinyl 
5 pyridine. The derivatized protein was desalted, 
lyophilized and subjected to CNBr cleavage in 70% 
trifluoroacetic acid (TFA) solution. Peptide fragments 
produced by CNBr were separated by SDS-PAGE 
electophoresis, elctrophoretically transferred onto 
10 Iramobilon®-P membrane and stained with non-acid 

Coomassie blue. Three main peptides of 28, 16 and 14 kD 
were observed and cut from the blot for N-terminal 
sequencing. Of these, the peptide of 14 kD gave the 
following amino acid sequence from its N-terminal: 

15 

Ile-Glu-Ile-Tyr-Lys-Tyr-Pro-Ala-Trp-Leu-Asp-Ile-Val-Glu-Ile 
(SEQ XD NO: 6). 

Based on this sequence from the first lie to the 
20 first two bases of the codon for the last lie, a set of 
eight degenerate 41 nucleotide-long oligonucleotides was 
synthesized. The design took into account the codon 
usage in selected soybean seed genes and used six 
deoxyinosines at positions of ambiguity. The probe, 
25 following radiolabeling, was used to screen a cDNA 

expression library made in Lambda Zap vector from polyA + 
RNA from 20-day-old developing soybean seeds. Five 
positively hybridizing plaques were subjected to plaque 
purification. Sequences of the pBluescript (Stratagene) 
30 vector, including the cDNA inserts, from each of the 

purified phage stocks were excised in the presence of a 
helper phage and the resultant phagemids used to infect 
fi. cells resulting in double-stranded plasmids, 

p22A f p22B f p23A, p25A, and p25B. 



WO 92/11373 



20 



PCT/US91/0910U 



10 



The cDNA insert in plasmid p22B is flanked at both 
ends by the two EcoRI sites introduced by the cDNA 
construction and its cloning into the vector 
pBluescript. The nucleotide sequ nee of the cDNA insert 
in p22B encodes a 367 amino acid open reading frame that 
includes the N-terminal sequence found in the purified 
protein at the fifty-sixth amino acid of the open 
reading frame. Thus the first fifty-five amino acids 
are presumably the transit peptide required for import 
of the precursor protein into the plastid. The 
methionine codon at base number 106 of p22B is the 
apparent start methionine since a) it is the first 
methionine after the last stop codons 5' to and inframe 
with the N-terminal sequence and, b> the N-terminal 
15 methionine in all but one known chloroplast transit 
peptides is followed by alanine. Thus, it can be 
deduced that the acyl-ACP thioesterase precursor protein 
encoded by this gene consists of a fifty-five amino acid 
transitpeptide and a 312 amino acid mature protein 
20 before any further proteolytic processing occurs. A 
fusion protein comprising the first sixteen amino acids 
of p-galactosidase, and beginning at the fourth amino 
acid of the mature soybean seed acyl-ACP thioesterase in 
an appropriate plasmid is expressed in £. sail and is 
25 catalytically functional. 

The entire cDNA insert in p22B was cut from the 
Bluescript plasmid, radiolabeled and used as a probe for 
additional thioesterase genes in the soybean seed cDNA 
library. Five additional clones were characterized. Of 
30 these, one is identical to clone 22B from one hundred 
bases before the stop codon of the open reading frame 
and through the 3' non-coding region. The other four 
appear to be identical to each other, but differ from 
22b. One of these, clones <4C) was sequenced completely 
35 and is shown in SEQ ID NO: 3. The open reading frame on 
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the cDNA encodes a thioesterase precursor protein which 
is again 367 amino acids in length and which, at the 
amino acid 1 v 1, is 97% identical to the thio sterase 
encoded by insert 22B. Both the S % and 3 1 non-coding 
5 sequences of the two genes diverge in identity as the 
distance from the open reading frame increases. 

The fragments of the instant invention may be used, 
if desired, to isolate substantially homologous acyl-ACP 
thioesterase cDNAs and genes, including those from plant 

10 species other than soybean. Isolation of homologous 

genes is well-known in the art. Southern blot analysis 
reveals that the soybean cDNA for the enzyme hybridizes 
to several, different-sized DNA fragments in the genomic 
DNA of tomato, rapeseed (prassica naPUS) / soybean, 

15 sunflower and Arabidopsls (which has a very simple 
genome) . Although the number of different genes or 
"pseudogenes" (non-functional genes) present in any 
plant is unknown, it is expected to be more than one 
since acyl-ACP thioesterase is an important enzyme. 

20 Moreover, plants that are amphidiploid (that is, derived 
from two progenitor species), such as soybean, rapeseed 
(£. papus ) . and tobacco will have genes from both 
progenitor species. 

25 flrarexoresston of the Enzyme in Transgenic Species 

The nucleic acid fragment of the instant invention 
encoding soybean seed acyl-ACP thioesterase cDNA, or a 
coding sequence derived from other cDNAs or genes for 
the enzyme, with suitable regulatory sequences, can be 

30 used to overexpress the enzyme in transgenic soybean as 
well as other transgenic species. Such a recombinant 
DNA construct may include either the native acyl-ACP 
thioesterase gene or a chimeric gene. One skilled in 
the art can isolate the coding sequences from the 

35 fragment of the invention by using and/or creating sites 
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for restriction endonucleas s, as described Sambrook 
et al. Mol cular Cloning: A Laboratory Manual, 2nd Ed. 
(1989), Cold Spring Harbor Laboratory Pr ss. Of 
particular utility are sites for Nco I (5'-CCATGG-3'> 
and Sph I (5'-GCATGC-3«) that allow precise removal of 
coding sequences starting with the initiating codon ATG. 
For isolating the coding sequence of acyl-ACP 
thioesterase precursor from the fragment of invention, 
an Nco I site can be engineered by substituting 
nucleotide A at position 105 with C. Cutting at this 
engineered site (or alternatively at an existing Hind 
III (5'-AAGCTT-3') site beginning at base pair 93 of 
p22B) along with cuts at restriction endonuclease sites 
near the 3' end of p22B such as the Spi I at 1339 or the 
Xmn I site at 1562 allows removal of the fragment 
encoding the acyl-ACP thioesterase precursor protein and 
directional re-insertion into a properly designed 
vector . 

Inhibition of Plant Target 

ffrnfff fry nse of aafciaeaafi rna 

Antisense RNA has been used to inhibit plant target 
genes in a dominant and tissue-specific manner (see 
van der Krol et al., Gene (1988) 72:45-50; Ecker et al., 
Proc. Natl. Acad. Sci. USA (1986) 83:5372-5376; 
van der Krol et al., Nature (1988) 336:866-869; Smith 
et al., Nature (1988) 334:724-726; Sheehy et al., Proc. 
Natl. Acad. Sci. USA (1988) 85:8805-8809; Rothstein 
et al., Proc. Natl. Acad. Sci. USA (1987) 84:8439-8443; 
Cornelissen et al., Nucl. Acids Res. (1988) 17:833-843; 
Cornelissen, Nucl. Acid Res. .(1989) 17:7203-7209; Robert 
et al., Plant Mol. Biol. (1989) 13:399-409; Cannon 
et al., Plant Molec. Biol. (1990) 15:39-47). 

The use of antisense inhibition of the seed enzyme 
would require isolation of the coding sequence for genes 
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that are expr ssed in the target tissue of the target 
plant. Thus, it will be more useful to use the fragment 
of the inv ntion to screen seed-specific cDNA libraries, 
rather than genomic libraries or cDNA libraries from 
5 other tissues, from the appropriate plant for such \ 
sequences. Moreover, since there may be more than one P 
gene encoding seed acyl-ACP thioest erase, it may be 
useful to isolate the coding sequences from the other 
genes from the appropriate crop. The genes that are 

10 most highly expressed are the best targets for antisense 
inhibition. The level of transcription of different 
genes can be studied by known techniques, such as 
nuclear run-off transcription. 

There have been examples of using the entire cDNA j_ 

15 sequence for antisense inhibition (Sheehy et al., Proc. U 
Natl. Acad. Sci. USA (1988) 85:8805-8809). Thus, for 
expressing antisense RNA in soybean seed from the 
fragment of the invention, the entire fragment of the 
invention (that is, the entire cDNA for soybean seed 

20 acyl-ACP thioesterase within the restriction sites 

described above) may be used. There is also evidence 

that the 3 f non-coding sequences can play an important 

role in antisense inhibition (Ch'ng et al., Proc. Natl. 

Acad. Sci. USA (1989) B6: 10006-10010) or short fragments i~ 

25 of 5' coding sequence (as few as 41 base-pairs of a 1.87 

Jcb cDNA) (Cannon et al., Plant Molec. Biol. (1990) ■ 
15:39-47). Thus, for expressing antisense RNA in 
soybean seed from the fragment of the invention, a small 
fragment of the invention, consisting of at least 41 

30 base pairs of the acyl-ACP thioesterase cDNA, may also 
be used. 
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Inhibition of Plant Target 
Genes bv Cosuppression 

The phenomenon of cosuppression has also been used 
to inhibit plant target genes in a dominant and tissue- 
5 specific mann r (Napoli et al., The Plant C 11 (1990) 
2:279-289; van der Krol et al. r The Plant Cell (1990) 
2:291-299; Smith et al., Mol. Gen. Genetics (1990) 224: 
477-481) • The nucleic acid fragment of the instant 
invention encoding soybean seed acyl-ACP thioesterase 
10 cDNA f or a coding sequence derived from other cDNAs or 
genes for the enzyme, along with suitable regulatory 
sequences, can be used to reduce the level of the enzyme 
in a transgenic oilseed plant which contains an 
endogenous gene substantially homologous to the 
15 introduced acyl-ACP thioesterase cDNA. The experimental 
procedures necessary for this are similar to those 
described above for sense overexpression of the 
acyl-ACP thioesterase cDNA. Cosuppressive inhibition of 
an endogenous gene using the entire cDNA sequence 
20 (Napoli et al., The Plant Cell (1990) 2:279-289; van der 
Krol et al. f The Plant Cell (1990) 2:291-299) and also 
using part of a gene (730 bp of a 1770 bp cDNA) (Smith 
et al., Mol. Gen. Genetics (1990) 224:477-481) are 
known. Thus, all or part of the nucleotide sequence of 
25 the present acyl-ACP thioesterase cDNA may be used to 
reduce the levels of acyl-ACP thioesterase enzyme in a 
transgenic oilseed. 

Selection of Hosts, Promoters and Enhancers 
30 A preferred class of heterologous hosts for the 

expression of the coding sequence of acyl-ACP 
thioesterase precursor or the antisense RNA are 
eukaryotic hosts, particularly the cells of higher 
plants. Particularly preferred among the higher plants 
35 are the oilcrops, such as soybean ( Glycine max ) . 
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rape seed f Brassica nanus . £. caiflPSStris) t sunflower 
f ltelianthus annus ) ,. cotton {Gossvpium hirS^tWR) r corn 
(JSfiflL mavs ) , cocoa fTheobroma , and peanut (ftrachis 

hvpoaaea ) . Expression in plants will use regulatory 
5 sequences functional in such plants. 

The expression of foreign genes in plants is well- 
established <De Blaere et al., Meth. Enzymol. (1987) 
153:277-291). The origin of promoter chosen to drive l 
the expression of the coding sequence or the antisense 

10 RNA is not critical provided it has sufficient 

transcriptional activity to accomplish the invention by 
increasing or decreasing, respectively, the level of 
translatable mRNA for acyl-ACP thioesterase in the 
•desired host tissue. Preferred promoters include (a) [ 

15 strong constitutive plant promoters, such as those j: 
directing the 19S and 35S transcripts in Cauliflower 
mosaic virus (Odell et al., Nature (1985) 313:810-812; L 
Hull et al., Virology (1987) 86:482-493), and (b) 
tissue- or developmentally-specif ic promoters. Examples 

20 of tissue-specific promoters are the light-inducible 
promoter of the small subunit of ribulose 1,5-bis- 
phosphate carboxylase if expression is desired in 
photosynthetic tissues, mai2e zein protein (Matzke 
et al., EMBO J. (1984) 3:1525), and chlorophyll a/b 

25 binding protein (Lampa et al., Nature (1986) 316:750- 
752) . 

Particularly preferred promoters are those that 
allow seed-specific expression. This may be especially 
useful since seeds are the primary source of vegetable 

30 oils and also since seed-specific expression will avoid 
any potential deleterious effect in non-seed tissues. 
Examples of seed-specific promoters include, but are not 
limited to, the promoters of seed storage proteins, 
which can represent up to 90% of total seed protein in 

35 many plants. The seed storage proteins are strictly 
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regulated, being expr ss d almost exclusively in seeds 
in a highly tissue-specific and stage-specific manner 
(Higgins et al., Ann. Rev. Plant Physiol. (1984) 
35:191-221; Goldberg et al., Cell (1989) 56:149-160). 
Moreover, different seed storage proteins may be 
expressed at different stages of seed development. 

Expression of seed-specific genes has been studied 
in great detail (See reviews by Goldberg et al., Cell 
(1989) 56:149-160 and Higgins et al., Ann. Rev. Plant 
Physiol. (1984) 35:191-221). There are currently 
numerous examples for seed-specific expression of seed 
storage protein genes in transgenic dicotyledonous 
plants. These include genes from dicotyledonous plants 
for bean (J-phaseolin (Sengupta-Gopalan et al., Proc. 
Natl. Acad. Sci. USA (1985) 82:3320-3324; Hoffman 
et al.. Plant Mol. Biol. (1988) 11:717-729), bean lectin 
(Voelker et al., EMBO J. (1987) 6:3571-3577), soybean 
lectin (Okamuro et al., Proc. Natl. Acad. Sci. USA 

(1986) 83:8240-8244), soybean Kunitz trypsin inhibitor 
(Perez-Grau et al., Plant Cell (1989) 1:095-1109), 
soybean p-conglycinin (Beachy et al., EMBO J. (1985) 
4:3047-3053; pea vicilin (Higgins et al., Plant Mol. 
Biol. (1988) 11:683-695), pea convicilin (Newbigin 

et al., Planta (1990) 180:461), pea legumin (Shirsat 
et al., Mol. Gen. Genetics (1989) 215:326); rapeseed 
napin (Radke et al., Theor. Appl. Genet. (1988) 75:685- 
694) as well as genes from monocotyledonous plants such 
as for maize 15 kD zein (Hoffman et al., EMBO J. (1987) 
6:3213-3221), maize 18 kD oleosin (Lee at al., Proc. 
Natl.. Acad. Sci. USA (1991) 888:6181-6185), barley 
p-hordein (Marris et al., Plant Mol. Biol. (1988) 
10:359-366) and wheat glutenin (Colot et al., EMBO J. 

(1987) 6:3559-3564). Moreover, promoters of seed- 
specific genes operably linked to heterologous coding 
sequences in chimeric gene constructs also maintain 
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their temporal and spatial expression pattern in 
transgenic plants. Such examples includ fry^bidops^ s 
thaliana 2S s ed storage prot in gene promoter to 
xpress enkephalin peptides in Arabidopsis and £. nflEius. 
5 seeds (Vandekerckhove et al., Bio/Technology (1989) 
7: 929-932) , bean lectin and bean p-phaseolin promoters 

to express lucif erase (Riggs et al., Plant Sci. (1989) 
63:47-57), and wheat glutenin promoters to express 
chloramphenicol acetyl transferase (Colot et al., EMBO 

10 J. (1987) 6:3559-3564). 

Of particular use in the expression of the nucleic 
acid fragment of the invention will be the heterologous 
promoters from several soybean seed storage protein 
genes such as those for the Kunitz trypsin inhibitor 

15 (Jofuku et al., Plant Cell (1989) 1:1079-1093; glycinin 
(Nielson et al. r Plant Cell (1989) 1:313-328), and 
P-conglycinin (Harada et al., Plant Cell (1989) 
1:415-425). Promoters of genes for a- and p-subunits of 
soybean p-conglycinin storage protein will be 

20 particularly useful in expressing the mRNA or the 

antisense RNA to acyl-ACP thioesterase in the cotyledons 
at mid- to late-stages of seed development (Beachy et 
al. , EMBO J. (1985) 4:3047-3053 in transgenic plants. 
This is because there is very little position effect on 

25 their expression in transgenic seeds, and the two 
promoters show different temporal regulation. The 
promoter for the a-subunit gene being expressed a few 
days before that for the p-subunit gene. This is 
important for transforming rapeseed where oil 

30 biosynthesis begins about a week before seed storage 
protein synthesis (Murphy et al.,. J. Plant Physiol. 
(1989) 135:63-69) . 

Also of particular use will be promoters of genes 
expressed during early embryogenesis and oil 

35 biosynthesis. The native regulatory sequences, 
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including the native promoter, of the acyl-ACP 
thioesterase gene expressing the nucleic acid fragment 
of the invention can be used following its isolation by 
those skilled in the art. Heterologous promoters from 
5 other genes involved in seed oil biosynthesis, such as g 
those for £. napus isocitrate lyase and malate synthase f 
(Comai et al., Plant Cell (1989) 1:293-300), Arfib i dgpsis 
ACP (Post-Beittenmiller et al., Nucl. Acids Res. (1989) 
17:1777), fi. naous ACP (Safford et al., Eur. J. Biochem. 

10 (1988) 174:287-295), £. saiDBSSillS. ACP (Rose et al., 
Nucl. Acids Res. (1987) 15:7197), and Sea IHBYS oleosin 
(Lee et al., Proc. Natl. Acad. Sci. USA (1991) 88:6181- 
6185) may also be used. The genomic DNA sequence for B_. 
napus oleosin is also published (Lee et al.. Plant £ 

15 Physiol. (1991) 96:1395-1397) and one skilled in the art f 
can use this sequence to isolate the corresponding 
promoter. The partial protein sequences for the 
relatively-abundant enoyl-ACP reductase and acetyl-CoA 
carboxylase are published (Slabas et al., Biochim. 

20 Biophys. Acta (1987) 877:271-280; Cottingham et al., 
Biochim. Biophys. Acta (1988) 954:201-207) and one 
skilled in the art can use these sequences to isolate 
the corresponding seed genes with their promoters. 

Attaining the proper level of expression of acyl- |_ 

25 ACP thioesterase mRNA or antisense KNA may require the \ 
use of different chimeric genes utilizing different \ 
promoters- Such chimeric genes can be transfered into 
host plants either together in a single expression 
vector or sequentially using more than one vector. 

30 It is envisioned that the introduction of enhancers 

or enhancer-like elements into either the native acyl- 
ACP thioesterase promoter or into other promoter 
constructs will also provide increased levels of primary 
transcription for antisense KNA or in RNA for acyl-ACP j 

35 thioesterase to accomplish the inventions. This would 
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include viral enhancers such as that found in the 35S 
promoter (Odell t al., Plant Mol. Biol. (1988) 10:263- 
272) , enhancers from the opine genes (Fromm et al.> 
Plant C 11 (1989) 1: 977-984) , or enhancers from any 
5 other source that result in increased transcription when |- 
placed into a promoter operably linked to the nucleic p 
acid fragment of the invention. 

Of particular importance is the DNA sequence 
element isolated from the gene for the a-subunit of 

10 p-conglycinin that can confer 40-fold seed-specific 
enhancement to a constitutive promoter (Chen et al. r 
Dev. Genet. (1989) 10:112-122). One skilled in the art 
can readily isolate this element and insert it within 
the promoter region of any gene in order to obtain seed- 

15 specific enhanced expression with the promoter in ^ 

transgenic plants. Insertion of such an element in any 

seed-specific gene that is expressed at different times 
than the P-conglycinin gene will result in expression in 

transgenic plants for a longer period during seed 

20 development. 

The invention can also be accomplished by a variety 
of other methods to obtain the desired end. In one 
form, the invention is based on modifying plants to 
produce increased levels of acyl-ACP thioesterase by L 

25 virtue of having significantly larger numbers of copies ] 
of the acyl-ACP thioesterase gene product. This may [ 
result in sufficient increases in acyl-ACP thioesterase 
activity levels to accomplish the invention. 

Any 3 1 non-coding region capable of providing a 

30 polyadenylation signal and other regulatory sequences 
that may be required for the proper expression of the 
acyl-ACP thioesterase coding region can be used to 
accomplish the invention. This would include the native 
3 Vend of the substantially homologous soybean acyl-ACP ] 

35 thioesterase gene(s), the 3' end from a heterologous 
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acyl-ACP thioesterase, the 3' end from viral genes such 
as the 3 f end of the 35S or the 19S cauliflower mosaic 
virus transcripts/ the 3* end from the opine synthesis 
genes, the 3 1 ends of ribulose 1, 5-bisphosphate 
5 carboxylase or chlorophyll a/b binding protein, or 3 f 
end sequences from any source such that the sequence 
employed provides the necessary regulatory information 
within its nucleic acid sequence to result in the proper 
expression of the promoter/acyl-ACP thioesterase coding 
10 region combination to which it is operably linked. 

There are numerous examples in the art that teach the 
usefulness of different 3' non-coding regions* 

Transformation Methods 
15 Various methods of transforming cells of higher 

plants according to the present invention are available 
to those skilled in the art (see EPO Pub. 0 295 959 A2 
and 0 318 341 Al) . Such methods include those based on 
transformation vectors based on the Ti and Ri plasmids 
20 of flgrofracterium SEE- It is particularly preferred to 
use the binary type of these vectors. Ti-derived 
vectors transform a wide variety of higher plants, 
including monocotyledonous and dicotyledonous plants, 
Sukhapinda et al,, Plant Mol. Biol. (1987) 8:209-216; 
25 Potrykus, Mol. Gen. Genet. (1985) 199:183). Other 

transformation methods are available to those skilled in 
the art, such as direct uptake of foreign DNA constructs 
(see EPO Pub. 0 295 959 A2), techniques of 
electroporation (Fromm et al., Nature (1986) (London) 
30 319:791) or high-velocity ballistic bombardment with 
metal particles coated with the nucleic acid constructs 
(Kline et al., Nature (1987) (London) 327:70). Once 
transformed, the cells can be regenerated by those 
skilled in the art. 
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Of particular relevance are the recently describ d 
methods to transform foreign genes into commercially 
important crops, such as rapeseed (De Block et al., 
Plant Physiol. (19B9) 91:694-701), sunflower (Everett 
5 et al., Bio/Technology (1987) 5:1201) , and soybean 
(Christou et al., Proc. Natl. Acad. Sci USA (1989) 
86:7500-7504) and corn (Fromm et al., (1990) 
Bio/technology 8:833-839). 

10 Application to RFLP Technology 

The use of restriction fragment length polymorphism 
(RFLP) markers in plant breeding has been well- 
documented in the art (Tanksley et al., Bio/Technology 
(1989) 7:257-264). The nucleic acid fragment of the 

15 invention indicates two gene copies by Southern 

blotting. Both of these have been mapped on a soybean 
RFLP map (Tingey et al., J. Cell Biochem. (1990), 
Supplement 14E p. 291, abstract R153) and can be used as 
RFLP markers for traits linked to these mapped loci. 

20 These traits will include altered levels of palmitic, 
stearic and oleic acid. The nucleic acid fragment of 
the invention can also be used to isolate the acyl-ACP 
thioesterase gene from variant (including mutant) 
soybeans with altered stearic acid levels. Sequencing 

25 of these genes will reveal nucleotide differences from 
the normal gene that cause the variation. Short 
oligonucleotides designed around these differences may 
be used as hybridization probes to follow the variation 
in stearic f palmitic and oleic acids. Oligonucleotides 

30 based on differences that are linked to the variation 
may be used as molecular markers in breeding these 
variant oil traits. 
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TpnT.ATTON CUP rttNA FOR 

p^YFFifiT 3 SEEP. *™,-*n> THTOBSTERASE 
PREPARATION OF RADIOLABELED PALMITOYL, STEAROYL AND 
5 OLEOYL-ACP 

?nr1f1r n f<"" Carxi&E Protrln fftCP) f ran . b ^jqII 

To frozen £. eoli cell paste, (0.5 kg of 1/2 log 
phase growth of £. eoli B grown on minimal media and 

10 obtained from Grain Processing Corp, Muscatine, IA) was 
added 50 mL of a solution 1 M in Tris, 1 M in glycine, 
and 0.25 M in EDTA. Ten mL of 1 M MgCl 2 was added and 
the suspension was thawed in a water bath at 50°C. As 
the suspension approached 37°C it was transferred to a 

15 37°C bath, made to 10 mM in 2-mercaptoethanol and 20 mg 
of DNAse and 50 mg of lysozyme were added. The 
suspension was stirred for 2 h, then sheared by three 20 
sec bursts in a Waring Blendor. The volume was adjusted 
to 1 L and the mixture was centrifuged at 24,000xg for 

20 30 min. The resultant supernatant was centrifuged at 
90,000xg for 2h. The resultant high-speed pellet was 
saved for extraction of acyl-ACP synthase (see below) 
and the supernatant was adjusted to pH 6.1 by the 
addition of acetic acid. The extract was then made to 

25 50% in 2-propanol by the slow addition of cold 
2-propanol to the stirred solution at 0°C. The 
resulting precipitate was allowed to settle for 2 h and 
then removed by centrifugation at 16,000xg. The 
resultant supernatant was adjusted to pH 6.8 with KOH 

30 and applied at 2 mL/min to a 4.4 x 12 cm column of DEAE- 
Sephacel which had been equilibrated in 10 mM MES, pH 
6.8. The column was washed with 10 mM MES, pH 6.8 and 
eluted with 1 L of a gradient of LiCl from 0 to 1.7 M in 
the same buffer. Twenty mL fractions were collected and 

35 the location of eluted ACP was determined by applying 
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10 flL of every second fraction to a lane of a native 
polyacrylamide (20% acrylamide) gel electrophoresis 
(PAGE) . Fractions luting at about 0.7 M LiCl contained 
nearly pure ACP and w re combined, dialyzed overnight 
5 against water and then lyophilized. 

Purification of Acyl-ftCP Synthase 

Membrane pellets resulting from the high-speed 
centrifugation described above were homogenized in 380 

10 mL of 50 mM Tris-Cl, pH 8,0, and 0.5 M in NaCl and then 
centrifuged at 80,000xg for 90 min. The resultant 
supernatant was discarded and the pellets resuspended in 
50 mM Tris-Cl, pH 8,0, to a protein concentration of 12 
mg/mL. The membrane suspension was made to 2% in Triton 

15 X-100 and 10 mM in MgCl2, and stirred at 0°C for 20 min 
before centrifugation at 80,000xg for 90 min. The 
protein in the resultant supernatant was diluted to 
5 mg/mL with 2% Triton X-100 in 50 mM Tris-Cl, pH 8.0 
and f then, made to 5 mM ATP by the addition of solid ATP 

20 (disodium salt) along with an equimolar amount of 

NaHC03. The solution was warmed in a 55°C bath until 
the internal temperature reached 53°C and was then 
maintained at between 53°C and 55°C for 5 min. After 
5 min the solution was rapidly cooled on ice and 

25 centrifuged at 15,000xg for 15 min. The supernatant 

from the heat treatment step was loaded directly onto a 
column of 7 mL Blue Sepharose 4B which had been 
equilibrated in 50 mM Tris-Cl, pH 8.0, and 2% Triton 
X-100. The column was washed with 5 volumes of the 

30 loading buffer, then 5 volumes of 0.6 M NaCl in the same 
buffer and the activity was eluted with 0.5 M KSCN in 
the same buffer. Active fractions were assayed for the 
synthesis of acyl-ACP, as described below, combined, and 
bound to 3 mL sett led- volume of hydroxy lapat it e 

35 equilibrated in 50 mM Tris-Cl, pH 8.0, 2% Triton X-100. 
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The bydroxylapatite was collected by centrifugation, 
washed twice with 20 mL of 50 mM Tris-Cl, pH 8.0, 2% 
Triton X-100. The activity was luted with two 5 mL 
washes of 0.5 M potassium phosphate, pH 7.5, 2% Triton 
5 X-100. The first wash contained 66% of the activity and 
it was concentrated with a 30 kD membrane filtration 
concentrator (Amicon) to 1.5 mL. 

Synthesis ffi£ Radiolabeled ftOVl-ftCP 

10 A solution of [ 3 H] palmitic acid, [ 14 C] -stearic 

acid and ["C] -oleic acid (120 nmol each) prepared in 
methanol were dried in a glass reaction vial. The ACP 
preparation described above (1.15 mL, 32 nmol) was added 
along with 0.1 mL of 0.1 M ATP, 0.05 mL of 80 mM DTT, 

15 0.1 mL of 8 M LiCl, and 0.2 mL of 13% Triton X-100 in 
0.5 M Tris-Cl, pH 8.0, with 0.1 M MgCl 2 . The reaction 
was mixed thoroughly and 0.3 mL of the acyl-ACP synthase 
preparation was added and the reaction was incubated at 
37°C. After 0.5 h intervals a 10 \IL aliquot was taken 

20 and dried on a small filter paper disc. The disc was 
washed extensively with chloroform: methanol: acetic acid 
(8:2:1, v:v:v) and radioactivity retained on the disc 
was taken as a measure of stearoyl-ACP . At 2 h about 
88% of the ACP had been consumed. The reaction mixes 

25 were diluted 1 to 4 with 20 mM Tris-Cl, pH 8.0, and 
applied to 1 mL DEAE-Sephacel columns equilibrated in 
the same buffer. The columns were washed in sequence 
with 5 mL of 20 mM Tris-Cl, pH 8.0, 5 mL of 80% 2- 
propanol in 20 mM Tris-Cl, pH 8.0, and eluted with 0.5 M 

30 LiCl in 20 mM Tris-Cl, pH 8.0. The column eluates were 
passed directly onto 3 mL columns of octyl-sepharose CL- 
4B which were washed with 10 mL of 20 mM potassium 
phosphate, pH 6.8, and then eluted with 35% 2-propanol 
in 2 mM potassium phosphate, pH 6.8. The eluted 
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products were lyophilized and redissolved at a 
concentration of 24 ^M. 

PREPARATION OF ALKYL-ACP AFFINITY COLUMN 

s 
■ 

5 1 

Synthesis nf N-hexadeRvliodoacetamide j 

2 

1-Hexadecylamine (3.67 inmol) was dissolved in 14.8 
mL of CH2Cl2r cooled to 4°C, and 2.83 mmol of iodoacetic 
anhydride in 11.3 mL of CH2CI2 was added dropwise to the 

10 stirred solution. The solution was warmed to room 

temperature and held for 2 h. The reaction mixture was 
diluted to about 50 mL with CH2CI2 and washed 3 times 
(25 mL) with saturated sodium bicarbonate solution and 
then 2 times with water. The volume of the solution was 

15 reduced to about 5 mL under vacuum and passed through 25 | 
mL of silica in diethyl ether. The eluate was reduced 
to an off-white powder under vacuum. This yielded 820 
mg (2.03 xnmol) of the N-hexadecyliodoacetamide (71.8% 
yield) . The 300 MHz !h NMR spectra of the product was 

20 consistent with the expected structure. 

Synthesis of N-Hexadecvlacetamido-S-ACP 

£. coli ACP prepared as above (10 mg in 2 mL of 
50 mM Tris-Cl, pH 7.6) was treated at 37°C with 50 roM . 

25 DTT for 2 h. The solution was made to 10% 

trichloroacetic acid (TCA) , held at 0°C for 20 min and I 
centrifuged to pellet. The resultant pellet was washed 
(2x2 mL) with 0.1 M citrate, pH 4.2 and redissolved in 
3 mL of 50 mM potassium phosphate buffer. The pH of the 

30 ACP solution was adjusted to 7.5 with 1 M KOH and 3 mL 
of N-hexadecyliodoacetamide (3 mM in 2-propanol) was 
added. A slight precipitate of the N-hexadecyliodo- 
acetamide was redissolved by warming the reaction mix to 
45°C. The mixture was held a 45°C for 6 h. SDS-PAGE on j 

35 20% acrylamide PAGE gel showed approximately 80% 
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conversion to an ACP sped s of intermediate mobility 
between the start ing, reduced ACP and authentic 
palmitoyl-ACP . Excess N-h xadecyliodo acetamide was 
removed from the reaction mix by 4 xtractions (3 mL) 
5 with CH2CI2 with gentle mixing to avoid precipitation of 
the protein at the interface. 

Coupling of N-Hexadecylacetamido-S-ACP 
to CNBr-activated Sephargse 4B 

10 Cyanogen bromide -activated Sepharose 4B (Pharmacia, 

2 g) was suspended in 1 mM HC1 and extensively washed by 
filtration and resuspension in 1 mM HC1 and finally one 
wash in 0.1 M NaHC03 f pH 8.3. The N-hexadecyl- 
acetamido-S-ACP prepared above was diluted with an equal 

15 volume of 0.2 M NaHC03, pH 8.3. The filtered cyanogen 
bromide-activated Sepharose 4B (about 5 mL) was added to 
the N-hexadecylacetamido-S-ACP solution f the mixture was 
made to a volume of 10 mL with the 0.1 M NaHCC>3, pH 8.3, 
and mixed by tumbling at room temperature for 6 h. 

20 Protein remaining in solution (Bradford assay) indicated 
approximately 85% binding. The gel suspension was 
collected by centrifugation, washed once with the 0,1 M 
NaHCOs, pH 8.3, and resuspended in 0.1 M ethanolamine 
adjusted to pH 8.5 with HC1. The suspension was allowed 

25 to stand at 4°C overnight and then washed by 

centrifugation and re-suspension in 12 mL of 0.1 M 
acetate, pH 4.0, 0.5 M in NaCl and then 0.1 M NaHCC>3, pH 
8.3, 0.5 M in NaCl. The alkyl-ACP Sepharose 4B was 
packed into a 1 x 5.5 cm column and washed extensively 

30 with 20 mM bis-tris propane-Cl (BTP-C1) , pH 7.2, before 
use. 
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ACYL-fiCP THIQESTERftSE ASSAY 
Acyl-ACP thioesterase was assayed as described by 
McKeon et al., (J. Biol. Chem. (1982) 257:12141-12147). 
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Each of the radiolabeled acyl-ACP's were adjusted to 
concentrations ranging from 0.05 JIM to 1.5 JIM and a 
volume of 25 J1L with a reaction buffer consisting of 
1 mg/mL bovine serum albumin in 0.1 M Tricine buffer at 
5 pH 8.2. Reactions were start d with 5 JIL of soybean j, 
seed extract containing acyl-ACP thioesterase activity j§ 
and incubated for times varying from 12 sec to 5 min 
depending upon the activity of the fraction. Reactions ^ 
were terminated by the addition of 100 \IL of a solution 
.10 of 5% acetic acid in 2-propanol and extracted twice with 
1 mL each of water saturated hexane. Five mL of 
ScintiVerse Bio HP (Fisher) scintillation fluid was 
added to the combined extracts and radioactivity in the 
released fatty acids was determined by scintillation 
15 counting. fl 
For routine assays during acyl-ACP thioesterase 
purification t 14 C] stearoyl-ACP at a concentration of 0.6 
(iM was used in the assay as described above. 

20 PTTRTFICMIOM QE SflSBEAH ACYL-ACP THIOESTERASE 

Developing soybean seeds ( Glycine max cultivar 
Wye), ca. 20-25 days after flowering, were harvested and 
stored at -80° until use. One kg of the seeds were 
added while frozen to 2 L of a buffer consisting of 50 

I** 

25 .mM TRIS/HC1 pH 8.0, 2 wM DTT and 0.2mM EDTA in a Waring | 
blendor and ground until thawed and homogenized. The i 
homogenate was centrifuged at 14,000xg for 20 min, 
decanted and the supernatant was centrifuged at 35,000xg 
for 45 min. The resulting high speed supernatant was 

30 adjusted to 55% saturation with ammonium sulfate at 4° 
and protein was allowed to precipitate for 30 min before 
centrifugation at 14,000xg for 15 min to remove 
precipitated proteins. The precipitated was dissolved 
in 50 mM BTP-HCl buffer, pH 7.2, 1 mM in 2-mercapto- j 

35 ethanol and dialyzed overnight against 15 L of the same 



WO 92/11373 



38 



buffer at 5 mM. The dialyzed ammonium sulfate fraction 
was adjusted to a buffer concentration of 20 raM, a 
protein concentration of 5 mg/mL and Triton X-100 was 
added to a final concentration of 0.02%. One third of 
5 the resulting solution was applied to a 250 mL column of 
Blue sepharose contained in a radial flow column. The 
flow rate was approximately 75 mL/min and the column 
wash washed with the application buffer until the 
absorbance at 280 nm monitored at the column efflux 

10 returned to zero after application of the protein. 

Acyl-ACP thioesterase activity was eluted with 1 M NaCl 
in the same buffer and the column was washed with an 
additional three column volumes of the salt containing 
buffer before re-equilibration with six column volumes 

15 of the starting buffer. This procedure was repeated 
twice more to bind and elute all of the acyl-ACF 
thioesterase activity present in the 55* ammonuim 
sulfate fraction. 

The combined Blue sepharose eluates were brought to 

20 85% saturation in ammonium sulfate at 4°, allowed to 

precipitate for 30 min, then centrifuged to at 20 f 000xg 
for 20 min. The resulting pellet was redissolved in 
20 mM TRIS-HC1, pH 7.4, 0.2 mM in EDTA and 1 mM in DTT 
then dialyzed overnight against 4 L of the same buffer. 

25 The dialysate was centrifuged at 22,000xg for 20 min 
then applied at a flow rate of 5 mL/min to Mono Q HR 
16/10 anion exchange column (Pharmacia) equlibrated in 
the same buffer. After application of the protein, the 
column was washed with the same buffer until the 

30 absorbance at 280 nm monitored at the column efflux 
returned to near zero. The loaded column was re- 
equilibrated to pH 8.5 with 20 mM TRIS-HC1, and after 
the pH monitored at the column efflux was stable at that 
pH, elution was begun with the following program: NaCl • 

35 concentration in the TRIS buffer system was increased 
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linearly from 0 to 0.1 M over 10 min r then held at 0.1 M 
for 10 min. The NaCl concentration was then increased 
lin arly from 0.1 M to 0.3 M over 80 min. The acyl-ACP 
thioest rase activity eluted broadly from an NaCl 
5 concentration of 0.165 M to 0.275 M. Active fractions 
were pooled, precipitated with ammonium sulfate as after 
Blue sepharose elution f redissolved in 20 mM BTP-HC1 at 
pH 7.2 and dialyzed overnight against 2 L of the same 
buffer at 5 mM. After dialysis, the Mono Q fraction was 

10 adjusted to 20 mM BTP-HCl and 0.02% Triton X-100 before 
application to the alkyl-ACP affinity column. The 
column was loaded at 1 mL/min, then washed with the 
application buffer until the absorbance at 280 nm 
monitored at the column efflux returned to zero. The 

15 column was then washed with 0.1 M NaCl in the same 

buffer until a protein peak was washed from the column 
and the column efflux 280 nm absorbance returned to zero 
before elution of the acyl-ACP thioesterase activity 
with 1 M NaCl in the BTP-HC1 buffer system. 

20 The eluant from the alkyl-ACP column was made to 

1 M in ammonium sulfate and applied at at flow rate of 
0.5 mL/min to a Phenyl Superose HR 5/5 column 
(Pharmacia) which was equlibrated with 1 M ammonium 
sulfate in 50 mM potassium phosphate buffer at pH 7.0. 

25 After sample application, the column was washed with 
equilibration buffer until the absorbance at 280 nm 
returned to zero and then eluted with a 20 mL gradient 
from 1 M ammonium sulfate in the potassium phosphate 
buffer to the buffer alone. 

30 Acyl-ACP thioesterase containing fractions from the 

Phenyl Superose column contained from 400 to 600 Jig of 
protein and were enriched in specific activity of the 
acyl-ACP thioesterase by from 8,500 to 10 f 500 fold 
depending upon the preparation. Gel filtration ' 

35 chromatography of the Phenyl Superose purified acyl-ACP 
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thioesterase on an UltroPac TSK G200 SW (0.75x60 cm, 
Pharmacia) eluted with 0.1 M potassium phosphate buffer 
at 1 mL/min gave one major protein peak which also 
corresponded with the acyl-ACP thioesterase activity. 
5 The molecular size estimation of that peak was 

approximately 75 kD. Analysis of the peptides present 
in the gel filtration purified acyl-ACP thioesterase 
showed three peptides of 39, 34 and 33 kD in size. The 
peptide at 39 kD was always least abundant and was not 
10 clearly visible in some preparations. Of the 34 and 33 
kD peptides, the abundance of the 34 kD peptide slightly 
exceeds that of the 33 . Further separation of these 
three peptides with retention of thioesterase activity 
has not been possible. 

15 

antibnriv P rAffi'nitation nf Soybean Seed Acvl-ACP 

Ihiaaatfiraafi 

Acyl-ACP thioesterase purified through the Phenyl 
Superose step was denatured with DTT and SDS applied to 

20 a gradient polyacrylamide gel (9 to 15% acrylamide) and 
subjected to SDS electrophoresis. The developed gel was 
stained with a 9:1 mixture of 0.1% Coomassie blue in 50% 
methanol to 0.5% Serva blue in 50% methanol then 
partially destained with 3% glycerol in 20% methanol. 

25 The peptide doublet at 33 and 34 kD was cut from the 

gel, frozen in liquid nitrogen, then ground to a powder 
and suspended in 50 mM sodium phosphate buffer. The 
suspended gel with protein was sent for antibody 
production in New Zealand White rabbit by Hazelton 

30 Research Products Inc. Denver, PA. Serum obtained after 
three injections of the combined 33 and 34 kD peptides 
identified those peptides in Western analysis, but also 
cross-reacted with the much less abundant peptide at 
39 kD which was not included in the antigen 

35 preparations. The anti-33 and 34 kD serum was purified 
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by immune affinity chromatography. Approximately one mg 
of acyl-ACP thioesterase purifi d through the Phenyl 
Superose step of the purification sequenc described 
above was bound to CNBr activat d sepharose (Pharmacia) 
5 according to the manufacturer's instructions. Five mL L 
of the antiserum was equilibrated in 10 mM potassium i 
phosphate buffer (pH 7.4) by gel filtration, mixed with \ 
the antigen bound to sepharose and allowed to bind 
overnight at 4°. The sepharose was poured into a small 
10 column and washed with 5 column volumes of the phosphate 
buffer then eluted with 0.1 M glycine (pH 2.5). 
Fractions of 0.9 mL were collected in tubes containing 
0.05 mL of 2 M TRIS and 1 mg of bovine serum albumin. 
Fractions containing the anti-33 and 34 JcD peptide f 
15 immunoglobin were identified by using each fraction as 
the antibody in Western analysis. Active fractions were 
pooled and concentrated to approximately 50 \LL by 
membrane concentration. 

Soybean seed acyl-ACP thioesterase was purified 
20 through the Mono-Q anion exchange step described in the 
scheme above. Fold purification over the starting 
extract was about 60. Ten \IL of this preparation was 
added to 2 fiL of 0.1 M TRIS/glycine buffer (pH 8.0) 
which contained from 0 to 2 fiL of the purified antibody 
25 preparation. The solution was incubated for 45 min at 

room temperature, then 20 \IL of Protein A-sepharose I 
(Sigma) was added and the mixture was incubated an 
additional 30 min. The Protein A-sepharose was removed 
by centrifugation and 3 \il of the supernatant was taken 
30 for the standard acyl-ACP thioesterase assay. Pre- 

immune serum from the rabbit was diluted 1 to 10 in the 
incubation mix with the acyl-ACP thioesterase 
preparation, incubated and treated with Protein 
A-sepharose as above for a control.' Net activity of the 

S 
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10 



15 



acyl-ACP thioesterase preparation after treatment with 
various dilutions of the antibody are shown below: 

TABLE 2 



r\ .* i -inn r\f ant*ibodv 


Net nmol/liL/min 


1 -t-o 1000 


3.46 




3.60 




3.85 


1 to 50 




1 to 25 


0.90 


1 to 16.6 


0.29 


1 to 12.5 


0.26 


1 to 10 


0.30 


1 to 5 


0 


Pre-immune 1 to 10 


3.46 


No antibody 


4.07 



f 



The acyl-ACP thioesterase can thus be precipitated 
by the anti-33 and 34 kD antiserum, indicating that 
20 these two peptides are either all or part of the soybean 
seed thioesterase enzyme. 

N-Terminal and Internal Amino Acid Sequence from the 

Acyl-ACP -thioesterase _ — ~ t 

25 ..Acyl-ACP thioesterase purified through the Phenyl j 

Superose step of the standard scheme was purified by * 
reversed-phase chromatography to remove the small amount 
of the 39 kD peptide and a trace of lower molecular 
weight contaminant. One hundred fig of the preparation 

30 in 1 ibL total volume was made to 0.1% trifluoroacetic 
acid (TFA) and loaded at 0.2 mL/min onto a Vydek 
diphenyl reversed phase column. The column was washed 
for 20 min with 0.1% TFA, then eluted by stepping to 25% j 
acetonitrile in 0.1% TFA, washing for 8 min then eluting j 

35 with a gradient from 25 to 70% acetonitrile in 0.1% TFA. 
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The 33 and 34 kD peptides eluted together at 35.5% 
acetonitrile. 

The combined peptides in the reverse phase purified 
fraction w r us d to determine the N-terminal amino 
5 acid sequence on an Applied Biosystems 470A Gas Phase jj_ 
Sequencer. PTH amino acids were analyzed on an Applied £ 
Biosystems 120 PTH amino Acid Analyzer. The N-terminal " f 
sequence was determined to be: 

10 R-V-E-A-P-G-G-T-L-A-D-R-L (SEQ ID NO: 5). 

Other residues were present in most cycles, most 
notably the P in cycle 5 and the G in cycle 6. 

Internal fragments of the combined peptides were f= 

15 also generated by CNBr cleavage. Acyl-ACP thioesterase t 1 
purified through the Phenyl Superose step in the 
purification scheme (400 \lg in 290 was denatured by 
the addition of 24 \IL of *M TRIS at pH 8.0, 15 mg of 
DTT, 31 JIL of 0.5 M EDTA and solid guanidine-HCl to make 

20 the solution 6 M in guanidine. The solution was 
incubated at room temperature for 2.5 h before the 
addition of 33 JiL of 4-vinyl-pyridine and then incubated 
an additional 4 h. The solution was desalted by 
dilution to 2.5 ml and passage through a sephadex G-25 

25 column which had been equilibrated in 2 mM TRIS, pH 8.0. j 
The solution was lyophilized, redissolved in 400 jiL of I 
70% TFA, placed in a sealable flask then evacuated and 
flushed with N2. CNBr (2 mg in 2 \IL of 70% TFA) was 
added and the flask was again evacuated and flushed with 

30 N2. After incubation for 20 h in the dark at room 

temperature , the reaction mixture was diluted to 4 mL 
with water and again lyophilized. The residue was 
dissolved in water and approximately 200 \ig (on the 
basis of the starting protein) was precipitated with 10% 

35 trichloroacetic acid (TCA) . The resulting pellet was * 
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removed by centrifugation, then washed in sequence with 
acetone, 1% TCA and acetone again. The washed pellet 
was dissolved in 100 JIL of 1% SDS with 7% glycerol and 
loaded onto a 20% crosslinked polyacrylamide gel for 
5 electrophoresis. The developed gel was 

electrophoretically blotted onto Immobilon membrane 
(Millipore), stained with 0.5% coomassie blue in 50% 
methanol and destained with 50% methanol. Three 
prominent bands of about 28 kD, 16 kD and 14 kD were cut 

10 from the Immobilon, and the N-terminal sequence of each 
was determined by gas phase sequencing as described 
above. With the exceptions of the 5th, 6th and 8th 
cycles, the sequence of the 28 kD fragment was identical 
to the N-terminal of the non-CNBr treated protein 

15 although other residues were present in all cycles. 
Nine cycles of sequence were obtained from the 16 kD 
band and 16 from the 14 kD band. The first nine cycles 
were identical for the two peptides, and the common 
sequence obtained for the fragments is as follows: 

20 

j-E-I-y-K-Y-P-A-W-L-D-I-V-E-I (SEQ ID NO: 6). 

n - nim ^ ssybjaa seed acvI-acp TH nest-erase cPPft 
Based on the N-terminal sequence from cycle 2 
25 through 11, a set of 64 degenerate 29 nucleotide-long 
probes were designed for use as a hybridization probe: 

(SEQ ID N0:7) 

PROTEIN SEQUENCE : VEAPGGTLAD 

30 

(SEQ ID NO: 8) 

DNA SEQUENCE: S'-GTT GAA GCN CCA GGA GGN ACN TTT GCA GA 
G G T T C G T 



WO 92/11373 PCT/US91/09160 

45 

(SEQ ID NO: 9) 

PROBE: 5'-GTT GAA GCI CCA GGI GGI ACI TTT GCA GA 

G G T C G T 

5 The design took into account the codon bias in L 

representative soybean seed genes encoding Bowman-Birk p 
protease inhibitor (Hammond et al., J. Biol. Chem. 
(1984) 259:9883-9890), glycinin subunit A-2B-la (Utsumi 
et al., Agric. Biol. Chem. (1987) 51:3267-3273), lectin 

10 (le-1) (Vodkin et al,, Cell (1983) 34:1023-1031), and 
lipoxygenase-1 (Shibata et al., J. Biol. Chem. (1987) 
262:10080-10085). Four deoxyinosines were used at 
selected positions of ambiguity. 

A cDNA library was made as follows: Soybean F 

15 embryos (ca. 50 mg fresh weight each) were removed from ^ 
the pods and frozen in liquid nitrogen. The frozen 
embryos were ground to a fine powder in the presence of 
liquid nitrogen and then extracted by Polytron 
homogenization and fractionated to enrich for total RNA 

20 by the method of Chirgwin et al. (Biochemist ry, (1979) 
18:5294-5299). The nucleic acid fraction was enriched 
for polyA+ RNA by passing total RNA through an oligo-dT 
cellulose column and eluting the polyA+ RNA by salt as 
described by Goodman et al. (Meth. Enzymol. (1979) 

25 68:75-90) . cDNA was synthesized from the purified I 
polyA+ RNA using cDNA Synthesis System (Bethesda \ 
Research Laboratory) and the manufacturer's 
instructions. The resultant double-stranded DNA was 
methylated by DNA methylase (Promega) prior to filling- 

30 in its ends with T4 DNA polymerase (Bethesda Research 
Laboratory) and blunt-end ligating to phosphorylated 
EcoRI linkers using T4 DNA ligase (Pharmacia) .. The 
double-stranded DNA was digested with EcoRI enzyme, 
separated from excess linkers by passing through a gel 

35 filtration column (Sepharose CL-4B) , and ligated to x 
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lambda ZAP vector (Stratagene) as per manufacturer's 
instructions. Ligated DNA was packaged into phag using 
Gigapack packaging extract (Stratagene) according to 
manufacturer's instructions. The r sultant cDNA library 
5 was amplified as per Stratagen 's instructions and 
stored at -80°C. 

Following the instructions in Lambda ZAP Cloning 
Kit Manual (Stratagene) , the cDNA phage library was used 
to infect £. coli BB4 cells and plated to yield ca. 

10 35,000 plaques per petri plate (150 mm diameter). 
Duplicate lifts of the plates were made onto 
nitrocellulose filters (Schleicher & Schuell) . 
Duplicate lifts from five plates were prehybridized in 
25 mL of Hybridization buffer consisting of 6X SSC 

15 (0.9 M NaCl, 0.09 M sodium citrate, pH 7.0), 5X 

Denhardt's [0.5 g Ficoll (Type 400, Pharmacia), 0.5 g 
polyvinylpyrrolidone, 0.5 g bovine serum albumin 
(Fraction V; Sigma)], 1 mM EDTA, 1% SDS, and 100 ug/mL 
denatured salmon sperm DNA (Sigma Chemical Co.) at 45°C 

20 for 10 h. Fifty pmol of the hybridization probe (see 
above) were end-labeled in a 52.5 uL reaction mixture 
containing 50 mM Tris-Cl, pH 7.5, 10 mM MgCl 2 , 0.1 mM 
spermidine-HCl (pH 7.0), 1 mM EDTA (pH 7.0), 5 mM DTT, 
200 HCi (66.7 pmol) of gamma-labeled AT 32 P (New England 

25 Nuclear) and 25 units of T4 polynucleotide kinase (New 
England Biolabs) . After incubation at 37°C for 45 min, 
the reaction was terminated by heating at 68°C for 10 
min. Labeled probe was separated from unincorporated 
AT 32 P by passing the reaction through a Quick-Spin™ (G- 

30 25 Sephadex®) column (Boehringer Mannheim Biochemicals) . 
The purified labeled probe (1.2 x 10 7 dpm/pmol) was 
added to the prehybridized filters, following their 
transfer to 10 mL of fresh Hybridization buffer. 
Following incubation of the filters in the presence of 

35 the probe for 48 h in a shaker at 48°C, the filters were 
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washed in 200 mL of Wash buffer (6X SSC, 0.1% SDS) five 
times for 5 min ach at room temperature, then at 48°C 
for 5 min and finally at 62°C for 5 min. The washed 
filters were air dried and subj cted to autoradiography 
5 on Kodak XAR-2 film in the presence of intensifying 
screens (Lightening Plus, DuPont Cronex®) at -80°C 

overnight. Six positively-hybridizing plaques were 
subjected to plaque purification as described in 
Sambrook et al. (Molecular Cloning, A Laboratory Manual, 

10 2nd ed. (1989), Cold Spring Harbor Laboratory Press). 
None of the potential positives purified to completion 
and all were eventually dropped as false positives. 

A second oligonucleotide probe was constructed 
based on the amino acid sequence derived from the CNBr 

15 fragments at 14 and 16 kD as follows: 

(SEQ ID NO: 10) 

PROTEIN SEQUENCE IEIYKY PAWLDIEI 



20 <SEQ ID N0:11) 

DNA SEQUENCE : 5 1 -ATN GAA ATN TAC AAA TAC CCN GCN TGG CTN GAC ATN GAA ATN 

G TGTT T T G 

(SEQ ID NO: 12) 

25 PROBE: ATI GAA ATI TAT AAA TAT CCI GCI TGG TTI GAT ATI GAA AT 

G G G 

The design is based on the same codon bias 
assumptions as the N-terminal probe described above, 

30 with the additional simplification of eliminating the C 
at all G/C ambiguities. Probe radiolabeling was done as 
described for the N-terminal probe and hybridization of 
nitrocellulose lifts was done similarly, except that the 
hybridization temperature was lowered to 37°, Screening 

35 of five plates with approximately 33,000 plaques each 
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produc d five positives which were then plaque purified. 
Of the five positives , four purified and isolated 
plaques could be taken corresponding to radioactive 
signals on the lifts in the s cond round of 
5 purification. - 
Following the Lambda ZAP Cloning Kit Instruction ~ 
Manual (Stratagene) , sequences of the pBluescript 
vector, including the cDNA inserts, from each of four 
purified phages were excised in the presence of a helper 

10 phage and the resultant phagemids were used to infect 
£. fifili XL-1 Blue cells resulting in double-stranded 
plasmids, p22A, p22B, p25A and p25B. Purity of the 
clones was checked by colony hybridization and a single, ^ 
positive colony from each was used for culture f- 

15 preparation. ^ 
DNA from the plasmids was made by the alkaline 
lysis miniprep procedure described in Sambrook et al. 
(Molecular Cloning, A Laboratory Manual, 2nd Ed, (1989) 
Cold Spring Harbor Laboratory Press). The alkali- 

20 denatured double-stranded DNA from p22B was sequenced 
using Sequenase" T7 DNA polymerase (US Biochemical 
Corp.) and the manufacturers instructions- The 
sequence of the cDNA insert in plasmid p22B is shown in 
SEQ ID NO:l. r 

25 j 

EXAMPLE 2 ■ 

EXPRESSION OF SOYBEAN SEED 

ACYIr-ftCP THIQSSTERASS IN E. CQLI 
Construction of {$-Galactosidase-Acyl-ACP Thioesterase 

30 Fusion Protein 

Sequences which are inserted into pBluescript 
directionally correct and in-frame with the start 
methionine of the interrupted p-galactosidase gene borne 

on the plasmid are capable of being expressed as fusion j 
35 proteins consisting of the N-terminal sixteen amino 
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acids of (J-galactosidase plus those encoded by the 
inserted sequence. Sequencing of p22B revealed that the 
cDNA insert of that plasmid was directionaly correct but 
1 base out of frame. Two Jig of p22B was digested for 2 

5 h with 30 units of Bam HI. This cleaves once in the p 

i 

polylinker site of the Bluescript portion of the plasmid 
and once at a Bam HI site beginning at base 282 of the 
insert in p22. The complete digestion gave two 
fragments, one of 301 bases from the 5 1 end of the cDNA 

10 insert and a portion of the poly linker region of 

Bluescript, and a 4.2 kB fragment composed of Bluescript 

and the 3 1 1320 bases of the cDNA insert. The 4.2 kB 

fragment was purified by electrophoretic separation on a .j_ 

6% polyacrylamide gel run in TRIS/borate/EDTA buffer. L 

15 The fragment was visualized by ethidium bromide U 
staining, cut from the gel, eluted into TRIS/EDTA buffer 
overnight at 37° and precipitated by the addition of 
sodium acetate to 0.3 M and ethanol to 50%. The two 
half-Bam HI sites on the purified fragment were re- 

20 ligated by incubation of 50 ng of the fragment in a 
25 llL reaction with eight units of T4 DNA ligase 
overnight at 16°C. Competent £. c Q l j. XL-1 blue cells 
(Statagene) were transformed with 30 ng of the ligated 
plasmid. Transf ormants were picked as ampicillin- i- 

25 resistant cells after overnight growth. Eight colonies 1 
were chosen and mini-preparations of plasmid DNA were 
made by the alkaline lysis procedure described above. 
Agarose gel electrophoresis of the uncut plasmids next 
to supercoiled weight standards showed that all eight 

30 plasmids were approximately 4.2 kB in size. The eight 
transformed cell lines containing plasmids designated 
p22Ba through p22Bh along with untrans formed XL-1 blue 
cells and the transformed line carrying p22B were grown 
overnight in 5 mL of TB media with 0.2% glucose. The j 

35 overnight cultures were diluted 1:1 into fresh TB + 
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glucose media which also contained 10 nM isopropyl 
thiogalactoside and growth was continued for 1.5 h at 
37°C. Cells were harvested by centrifugation and re- 
suspended in 1 mL of 50 mM IRIS, pH 8.0. A subsample 
5 containing 10 \Lg protein was taken and added to 20 JlL of 
SDS sample buffer for analysis by SDS-PAGE and western 
blotting. The remaining sample was made to 10 mM with 
DTT, 0.2 mM with PMSF and broken by probe sonication. 
Cell debris was removed by centrifugation and 5 JIL of 
10 the extract was used in the standard acyl-ACP 

thioesterase assay using stearoyl-ACP as the substrate. 
The results are shown in Table 3. 

TABLE 3 



15 


Extract. 


Net. reaction 


XL-1 blue 


0.42 




p22B 


0.58 




p22Ba 


2.17 




p22Bb 


2.05 


20 


p22Bc 


2.25 




p22Bd 


2.17 




p22Be 


2.11 




p22Bf 


2.25 




p22Bg 


1.84 


25 


p22Bh 


1.71 



While p22B does not have activity significantly 
greater than the endogenous £. fioJLL activity, 
thioesterase activity was greatly increased by the 
30 combination of removing the transit. peptide and placing 
the construction in-frame relative to the fusion protein 
start methionine. Western analysis of the proteins 
produced by each of the cell lines also showed a single, 
antibody-positive signal of about 42 kD in size produced 
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by each of the in-frame plasmids, but no signal produced 
by plasmid p22B. 

Plasmid p22Ba was chos n for more detailed analysis 
using both palmitoyl and oleoyl-ACP as substrates. 
5 Cells containing p22B were used as the controls \ 
indicative of the endogenous £. eo}i thioest erase. When \ 
palmitoyl-ACP was used as substrate, p22B cell extract 
showed a low but measurable reaction rate while that of 
the p22Ba-containing cells was ten fold higher. When 
10 oleoyl-ACP was used as substrate, the rate of acyl-ACP 
hydrolysis by extract from the p22Ba~containing cells 
was 96 fold greater than that of the controls. 

EXAMPLE 3 \ 

15 USE OF SOYBEAN SEEP ACY1-ACP j 

THIOESTERASE SEQUENCE IN PLASMIP AS A 
RESTRICTION FRAGMENT LENGTH POLYMORPHISM (RFLP) MARKER 
The cDNA insert from plasmid p22B was removed from 
the Bluescript vector by digestion with restriction 

20 enzyme EcoRI in standard conditions as described in 

Sambrook et al. (Molecular Cloning, A Laboratory Manual, 
2nd ed. (1989) Cold Spring Harbor Laboratory Press) and 
labeled with 32 P using a Random Priming Kit from 
Bethesda Research Laboratories under conditions 

25 recommended by the manufacturer. The resulting j 
radioactive probe was used to probe a Southern blot * 
(Sambrook et al., Molecular Cloning, A Laboratory 
Manual, 2nd ed- (1989) Cold Spring Harbor Laboratory 
Press) containing genomic DNA from soybean (Gl ycine max 

30 (cultivar Bonus) and Glycine soi ? (PI81762)), digested 
with one of several restriction enzymes. After 
hybridization and washes under standard conditions 
(Sambrook et al., Molecular Cloning, A Laboratory 
Manual, 2nd ed. (1989) Cold Spring Harbor Laboratory 

35 Press), autoradiograms were obtained and different 5 
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patterns of hybridization (polymorphisms) were 
identified in digests performed with restriction enzymes 
Pst 1 and Eco RI. The same probe was then us d to map 
the polymorphic p22B loci on the soybean genome , 
5 essentially as described by Hel ntjaris t al. (Theor. 
Appl. Genet. (1986) 72:761-769). Plasmid pDSl probe was 
applied, as described above, to Southern blots of EcoRI, 
Pstl f EcoRV, BamHI, or Hindlll digested genomic DNAs 
isolated from 68 F2 progeny plants resulting from a 

10 £. max Bonus x £. so-ia PI81762 cross. The bands on the 
autoradiograms were interpreted as resulting from the 
inheritance of either paternal (Bonus) or maternal 
(PI81762) pattern, or both (a heterozygote) . The 
resulting data were subjected to genetic analysis using 

15 the computer program Mapmaker (Lander et al., Genomics 
(1987) 1: 174-181). In conjunction with previously 
obtained data for 436 anonymous RFLP markers in soybean 
(S. Tingey et al., J. Cell. Biochem., Supplement 14E 
(1990) p. 291, abstract R153), we were able to position 

20 one genetic locus corresponding to the p22B probe on the 
soybean genetic map. This information will be useful in 
. breeding soybean lines with altered saturate levels. 

EXAMPLE 4 

25 USE OF SOYB EAN SEED AHYL-ACP THTOESTERASE SEQUENCE IN 
PT.ASMTD n2?B AS A PROBE FOR ADDITIONAL SOYBEAN ACYL-ACP 

THTOESTERASE GENES 
The cDNA insert in plasmid p22B was removed by 
digestion with EcoRI and purified by electrophoretic 
30 separation on 6% polyacrylamide . The 1.6 kB fragment 
was localized by ethidium bromide staining, eluted from 
the gel and precipitated from 0.3 M sodium acetate with 
50% ethanol. Thirty ng of the resulting DNA fragment 
was used as the template in a random primer labeling 
35 reaction using a labeling kit from Bethesda Research 



53 

Laboratories. The early development soybean seed cDNA 
library described in Example 1 was re-plated at a plaque 
density of 35,000 per plate and duplicate nitrocellulose 
lifts from four plates were scr n d. The pre- 
5 hybridization and hybridization buffer was that t 
described in Example 1, but the probe annealing i 
conditions were 50° for 40 h. The filter lifts were ' 
washed 3 times at room temperature with 0.6 x SSC 
containing 0.1% SDS, then once at 50°C for 5 min in the 

10 same solution. Two additional washes were given for 5 
min each at 50°C in 0.2 x SSC, 0.1% SDS followed by a 1 
min rinse under the same conditions. 

After autoradiography for 20 h, ten hybridizing 
plaques were identified. These were plaque purified and r 

15 excised into Bluescript plasmids as described in Example |I 
1. To check for the similarity of the cDNA inserts in 
these plasmids to the sequence of soybean seed acyl-ACP 
thioesterase copy 1 shown in SEQ ID NO:l, a 30 base 
oligonucleotide was prepared for use as the extension 

20 primer in dideoxy sequencing reactions. The primer 
corresponded to bases 1028 to 1058 in the sequence of 
SEQ ID NO: 13. 

The placement of the primer oligonucleotide on 
cDNA's similar to that found in p22B should allow ^ 

25 sequencing the 3* 100 bases of the open reading frame I 
and 100 to 170 bases of the 3 f untranslated region. \ 
Bluescript plasmids purified from six of the ten 
positively hybridizing clones described above were 
sequenced. Of these, one did not give a sequencing 

30 reaction with the primer. Sequencing from the universal 
and T3 primers of the Bluescript plasmid revealed that 
this clone was a partial cDNA, identical to the insert 
in p22B, but terminating 3 1 to the primer region. The 
sequences of the remaining five clones used as templates i 

35 showed two classes of sequence, one clone identical 
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through the region sequenced to the P 22B and four 
xamples or" a second acyl-ACP thioesterase gene with a 
single base change in the portion of the open reading 
frame which was segu need (at base 1094 of SEQ ID N0:1, 
C is changed to T) and decreased homology in the 3' non- 
coding region. 

Nucleotide 1 of SEQ ID N0:1 is the first nucleotide 
of the EcoRI cut site reading from 5' to 3' on the cDNA 
insert and nucleotide 1602 is the last nucleotide of the 
cDNA insert in the EcoRI cut site of plasmid P 22B which 
encodes copy 1 of the soybean seed acyl-ACP 
thioesterase. Nucleotides 106 to 108 are the putative 
translation initiation codon, nucleotides 271 to 273 are 
the codon for the N-terminal of the purified enzyme, 
15 nucleotides 1207 to 1209 are the termination codon, 

nucleotides 1 to 5 are the 5' untranslated sequence and 
nucleotides 1210 to 1602 are the 3' untranslated 
nucleotides. 

Digestion of two of the plasmids < P 4A and P 4C) with 
EcoRI followed by analysis on agarose gel 
electrophoresis showed cDNA inserts of 1.0 and 1.4 kB 
respectively. Dideoxy sequencing of both plasmids 
showed them to be identical and the insert in P 4C to be 
a full length clone. By the very high degree of 
25 homology between the open reading frames of P 4C and 

P 22B, p4C can reasonably be expected to encode a second 
acyl-ACP thioesterase. The base and amino acid 
sequences of soybean seed acyl-ACP thioesterase (copy 2) 
are shown in SEQ ID NO:3. 
30 . Nucleotide 1 of SEQ ID NO: 3 is the first nucleotide 

of the EcoRI cut site reading from 5 ' to 3' on the cDNA 
insert and nucelotide 1476 is the last nucleotide of the 
cDNA insert in the EcoRI cut site of plasmid p4C which 
encodes copy 2 of the soybean seed acyl-ACP 
35 thioesterase. The putative initiation codon is 
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nucleotides 117 to 119/ the N-terminal of the mature 
protein is nucleotides 282 to 284, and the termination 
codon is nucleotides 1218 to 1220. 

5 EXftMPlE 5 

p 9?R aS A PROBE FOR AHYL-ACP TH T OE S TERASE GENES FROM 

The 32 P~labeled probe produced by random primed 

10 labeling using the EcoRI fragment from p22B as described 
in EXAMPLE 4 was used to screen a genomic library made 
from ^^assic napus cultivar Bridger DNA (Clontech 
commercial library) . The library was plated on two 
plates at a density of approximately 60,000 plaques per 

15 plate and duplicate nitrocellulose lifts were taken for 
hybridization. The prehybridization and hybridization 
buffer was that described in Example 1 with annealing of 
the probe for 55 h at 42°C. The filter lifts were 
washed twice at room temperature with 0.6x SSC 

20 containing 0.1% SDS followed by two 5 min washes and one 
1 min wash in the same solution and all at 52°C. 

Hybridizing plaques were identified by 
autoradiography for 18 h at -70°C. Of three positive 
signals present on the duplicate plates , two were chosen 

25 for plaque purification by removal from the plate, 
dilution and re-screening under the above described 
conditions. Single plaques from the two independent 
clones (designated pCANll and pCAN21) were chosen, cored 
to remove them from the plate, diluted and re-plated at 

30 low titer for amplification. Ten plaques from each of 
the clonal lines were selected, homogenized in buffer 
and used to inoculate a 0.5 mL culture of E. strain 
MN538 at a cell density of 0.5 ODgoo* The inoculum was 
used to start a 100 mL culture in LB media and was grown 

35 to cell lysis. Phage DNA was purified from the culture 
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as described in Sambrook et al. (Molecular Cloning, A 
Laboratory Manual, 2nd ed. (1989) Cold Spring Harbor 
Laboratory Press). DNA from the two clones was digested 
with the following combinations of restriction 
5 endonucleases: Sal I alone, Sal I+EcoRl, Sal I+Xba I, 
Sal I+NotI, and Sal I+Bam HI. The digests were 
subjected to electrophoresis on 1% agarose for blotting 
to Hybond-N (Amersham) . Southern analysis after 
hybridization to the radiolabeled, random-primed probe 

10 from p22B as described above revealed that all 

hybridizing sequence from pCANll resided on a 3 kB Sal 
1/Xba I fragment and that all hybridizing sequence from 
pCAN21 resided on a 6 kB Sal I/EcoRI fragment. These 
two fragments were again generated by digestion from the 

15 corresponding clone, purified from the other fragments 
by electrophoresis on 1% agarose, excised from the gel 
after ethidium bromide staining and removed from the 
agarose by treatment with Gelase (Epicentre 
Technologies), phenol extraction and ethanol 

20 precipitation of the aqueous phase. The fragments were 
ligated into the plasmid Bluescript SK+ (Stratagene) 
which had been double digested with the corresponding 
restriction endonucleases and used to transform 
competent £. coli cells. Both the ligation and 

25 transformation procedures were as described in Example 2 
above. Three positives from pCAN21 and 5 positives from 
pCANll were found and confirmed by purification of 
plasmid DNA and digestion with the endonucleases used to 
generate the ligated inserts . 

30 The shorter, 3 kB clone was chosen for sequencing 

by the dideoxy method as described in Example 1, above 
using the double-stranded Bluescript plasmid as the 
template. The clone was partially sequenced from the 
genomic insert in the M13 universal primer on 

35 pBluescript and two primers made corresponding to 
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segments of p22B. That sequence is shown in SEQ ID 
NO: 20. Sequence alignment with p22B (Deveraux et al. 
(1987) Sequence Analysis Package of the Genetics 
Computer Group, University of Wisconsin Biot chnology 
5 Center) showed a sequence identity of 73.6% after the 
insertion of eight gaps. The sections of alignment span 
1170 bases of the pCANll insert and correspond 
approximately to bases 424 through 1027 in SEQ ID NO:l. 
Five of the eight pCANll sequences which do not align 

10 with p22B appear to be introns, the remaining three gaps 
maybe introns or the combination of intron with coding 
regions which are less homologous with p22B. Assuming 
reasonable intron splicing, the resulting open reading 
frame encodes 168 amino acids of the putative 

15 thioesterase. Of these residues, 132 are identical to 
the soybean seed acyl-ACP thioesterase and fifteen 
residues present in the soybean protein are not 
accounted for in the gene from Brassica , Clone pCANll 
thus encodes a large portion of the Brassica acyl-ACP 

20 thioesterase. 

EXAMPLE g 

P22B AS A PROBE FOR ACYL-ACP THTOESTERASE G ENES FROM 
CUPHEA LANCEOLATA AND CHUPEA VISCOSISSTMA- 

25 

Genomic clones of acyl-ACP thioesterases from 
CttPhea viscosissima and Cuphea lanceolata were obtained 
using a polymerase chain reaction (PCR) strategy using 
initiation primers based segments of the sequence of 

30 p22B. Three segments were chosen from the deduced amino 
acid sequence as amino acid sequences encoded by 
relatively non-degenerate DNA codons. These segments 
were synthesized to include all probable DNA sequences 
encoding the amino acid sequence. The sequences 

35 synthesized and their approximate corresponding bases in 
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SEQ ID NO:l are shown below. Positions at which all 
combinations of multiple bas s were synthesiz d are 
denoted as combinations of bases inside par nthesis. 

5 

TC-58 5«-TA(T/C)AA(G/A)SA(GA)AA(G».)TT(T/C)-3« (SEQ ID N0:14 - 
corresponding to bases 343 through 357 of SEQ ID N0:1) 

TC-59 5 1 -AA (A/G) TGGGT (A/T/G/ C) ATGATGAA (T/C) CAA-3 * (SEQ ID N0:15 - 
10 corresponding to bases 676 through 696 of SEQ ID N0:1) 

TC-60 5 « (C/T) TG (A/G) TTCATCAT (A/T/G/C) ACCCA (T/C) TT-3 * 
(SEQ ID N0:16 - corresponding to the complementary strand of TC- 
59) 

15 

TC-61 3 ' -CT (T/C) CT (C/A) TT (T/C) GT (A/G) CT (T/C) GT (G/A) GT- 
(T/C)GT(C/T)-5' (SEQ ID NO: 17 - corresponding to conplementary 
strand of bases 1125 through 1101 of SEQ ID N0:1) . 

20 Four PCR reactions were run using buffers, 

deoxynudeotides, TAQ polymerase and reaction conditions 
from a GENEAMP kit (Perkin-Elmer/Cetus) , with 200 ng of 
genomic dna from either £. lanceplata or £. yi scpsiss i ma 
as template and either TC-58 and TC-60 or TC-59 and TC- 

25 61 as the sense and antisense primers. The degenerate 
primers were used at a final concentration of 1 mM. The 
temperature cycling reactions were carried out in a 
Perkin-Elmer/Cetus Thermocycler with the temperature at 
the annealing cycle set to 37°C. The extension and 

30 denaturation steps were 72°C and 92 6 C respectively and 
30 cycles were preformed. 

Ho products were formed with the TC-59/TC-61 primer 
set. A product of about 0.7 kB in size was formed with 
the TC-58/TC-60 primer set using genomic DNA from either 

35 species as template. The 0.7 kB fragment from both 
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species was purified from several minor products also 
present in the initial PCR reaction and used as the 
template for re-amplification using the same conditions 
as in the initial reaction. Both products amplifi d and 
5 were again gel purified for blunt-end cloning into Eco | 
RV-cut and phosphatase-treated Bluescript SK« One f 
hundred ng of both fragments were used in a 10 JlL 
ligation reaction at 12°C overnight. One J1L of the 
ligation mix was used to transform 100 )1L of competent 
10 £. coli cells, Transformants were recovered by plating 
on plates containing amphicillin (150 Jlg/mL) to which 
was also added 50 JIL of 5~bromo-4-chloro-3-indolyl-p-D- 

galactopyrannoside (X-gal) (20 Jig/mL) and 10 JiL of lOOmM ; 
IPTG. Six white colonies were recovered from the | 

15 £. lanceolata transformation and seven from the r 
£. vipco^ssima transformation. Plasmid DNA was 
prepared from each of the thirteen cell lines and 
digested with restriction endonucleases to excise the 
cloned insert from the Bluescript plasmid. One insert 

20 of the expected size was obtained from both species and 
double stranded plasmid was prepared from the each of 
the two cell lines for sequencing. 

An 865 base pair insert was sequenced from 
£. viscosissima (SEQ ID NO:22) and an 852 base pair \ 

25 insert was sequenced from £. lanceolata (SEQ ID NO:21) . I 

i 

Sequence alignment (Deveraux et al., (1987) Sequence 
Analysis Package of the Genetics Computer Group, 
University of Wisconson Biotechnology Center) shows that 
the two sequences are 96.6% identical to one another. 
30 Similar alignment of the sequence from £. viscosissima 
with that of p22B (SEQ ID NO:l) shows an overall 
identity of 79.9% with the insertion of thre gaps. The 
gaps appear to be introns and the sequence ends are in 
agreement with the sequences of p22B which were used to 5 
1 35 design the PCR primers. Removal of the introns and 
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translation of the resulting open reading frame giv s 
two amino acid sequences which are 93% identical to the 
sequence derived from the corresponding base sequence of 
p22B. The two clones are thus partial copies of the 
5 genomic £. gjafiaaiafiiM and £. l anceo l ata acyl-ACP 
thioesterases . 

EXAMPLE 7 

SSKSSEBEIiaB ™ W22 B& ™* transformation OF PLANTS FOR 
10 fiT.T™™ F.yPREgsT^w nr ACYL-RCP THTOESTERASE 

, c; cr , ffc pr>rl aniHsen pf gxpression constructions Ufiinq the 
CanSfciSBfciSS SSS promoter. 

The starting vectors for the 35S constructions were 

15 p22B carrying the soybean seed acyl-ACP thioesterase 
gene and pK35K. pK35K was in turn derived from pKNK 
(WO91/09957) . pKNK is a pBR322-based vector which 
contains a neomycin phosphotransferase II (Nptll) 
promoter fragment, a nopaline synthase (NOS) promoter 

20 fragment, the coding region of Nptll and the 

polyadenylation region from the NOS gene. A map of this 
plasmid is shown by Lin et al. (Plant Physiol. (1987) 
84: 8S6-861) . The 320 bp Clal-Bglll fragment in pKNK 
that contains the Nptll promoter was obtained as a 

25 Hindlll-Bglll fragment from the Nptll gene of the 

transposon Tn5 described by Beck et al. (Gene (1982) 19: 
327-336) . The Hindlll site was converted to a Clal site 
by linker addition. The Nptll promoter fragment is 
followed by a 296 bp Sau3A-PstI NOS promoter (NOS/P) 

30 fragment corresponding to nucleotides -263 to +33, with 
respect to the transcription start site, of the NOS gene 
described by Depicker et al. (J. Appl. Genet. (1982) 1: 
561-574). The PstI site at the 3" end was created at 
the translation initiation codon of the NOS gene. The 

35 NOS/P is followed by a 998 bp Hindlll-BamHI sequence 
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containing the Nptll coding r gion obtained from the 
transposon Tn5 (Beck et al., (1982) Gene 19: 327-336 ) 
by the creation of Hindlll and BamHI sites at 
nucleotides 1540 and 2516, respectively. The Nptll 
5 coding region is then followed by a 702 bp BamHI-Clal 
fragment containing the 3 1 end of the nopaline synthase 
gene including nucleotides 848 to 1550 (Depicker et al., 
J. Appl. Genet. (1982) 1: 561-574). The remainder of 
pKNK consists of pBR322 sequences from 29 to 4361. 

10 pKNK was converted to pK35K by removing the Nptll 

and NOS promoters and replacing them with a CaMV 35S 
promoter. The EcoRI -Hindlll 35S promoter fragment is 
the same as that contained in pUC35K (WO91/09957) . The 
35S promoter fragment was prepared as follows, and as 

15 described in Odell et al. (Nature (1985) 313: 810-813) 
except that the 3' end of the fragment includes CaMV 
sequences to +21 with respect to the transcription start 
site. A 1.15 kb Bglll segment of the CaMV genome 
containing the region between -941 and +208 relative to 

20 the 35S transcription start site was cloned in the BamHI 
site of the plasmid pUC13. This plasmid was linearized 
at the Sail site in the polylinker located 3 f to the 
CaMV fragment and the 3' end of the fragment was 
shortened by digestion with nuclease Bal31. Following 

25 the addition of Hindlll linkers, the plasmid DNA was re- 
circularized. From nucleotide sequence analysis of the 
isolated clones, a 3' deletion fragment was selected 
with the Hindlll linker positioned at +21. To create 
pK35K this 35S promoter fragment was isolated as an 

30 , EcoRI-Hindlll fragment, the EcoRI site coming from the 
polylinker of pUCl3, and ligated to pKNK that had been 
digested with EcoRI and Hindlll, the EcoRI site lying 5' 
to the Clal site in pBR322. 

pK35K was digested with BamHI and the cut ends were 

35 blunted using the Klenow fragment of DNA polymerase. 
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Digestion with HinDXII, then r moved the Hptll coding 
r gion leaving P K35K lineariz d with a half HinDIII site 
at th 3* end of the 35S promoter sequence and a blunt 
nd 5' to th NOS 3' r gion. Digestion of p22B with the 
5 combination of HinDIII and EcoRV released a fragment 
which begins twelve bases 5' to the start methionine of 
the soybean seed acyl-ACP thioesterase precursor protein 
and ends in the Bluescript vector just 3« to the 3* non- 
coding region of p22B. Gel purification of both the 

10 p22B-derived fragment and the modified pK35K fragment as 
described in Example 2 followed by ligation of the 
fragments with T4 DNA ligase gave pKTE9 which contains 
the coding sequence for soybean seed acyl-ACP 
thioesterase linked to the 35S promoter in a manner 

15 expected to produce a functional enzyme in an 
appropriate cell. 

To produce an expression vector for production of 
antisense message from p22B, pK35K was digested with the 
combination of BamHI and HinDIII to remove the existing 

20 coding sequence for Nptll and the ends of the remaining, 
linearized plasmid were blunted using the Klenow 
fragment of DNA polymerase. Two XmnI sites exist in 
p22B (at the 5' end coincident with the EcoRI used for 
cloning into pBIuescript and spanning bases 1662 through 

25 1672 at the 3' end of SEQ ID NO:l) so that digestion 
with XmnI removes the entire sequence of p22B including 
the 5' and 3* non-coding regions of the cDNA and leaves 
blunt ends. Gel purification of the desired fragments 
as described above followed by blunt end ligation and 

30 recovery of transformants gave both the sense and 

antisense orientations of p22B 3' to the 35S promoter. 
Orientation of the p22B insert in pK35L was determined 
by restriction mapping using the combination of 
restriction endonucleases EcoRI and. BamHI. The combined 

35 digestion releases a 1101 base pair fragment (950 bases 
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from the pK35K plasmid and 116 bases from the XmnI 
insert from p22B) in the case of sense orientation of 
p22B with r spect to the promoter and a 2365 base pair 
fragment in the case of antisens orientation (950 has s 
5 from pK35K and 1415 bases from the XmnI fragment of 
p22B) . The antisense orientation construction (pKTER) 
is suitable for use in antisense constructs because it 
contains all of the 5 1 and 3* noncoding regions. 

The soybean somatic embryo transformation described 

10 below requires the use of hygromycin as the selectable 
marker for transformation. To introduce this selectable 
marker into the vector , a second plasmid pML18 was 
constructed by the introduction of a DNA segment 
containing the 35S promoter from pK35K 5* to the 

15 hygromycin phosphotransferase gene from £. coli (Gritz 
et al. Gene (1983) 25:179) and 3 1 to the NOS 3' end. 
This segment was ligated into the Sail site of the 
plasmid pGEM92 (Promega) . To introduce the 35S:acyl-ACP 
thioesterase:NOS construction into pML18, pKTE9 was cut 

20 with AatI and Clal and blunted with the Klenow fragment 
of DNA polymerase. AatI cuts pKTE9 just 5 1 to the 35S 
promoter and Clal just 3 1 to the NOS 3' end. Xbal 
linkers were ligated to the blunt ended fragment f the 
fragment was purified by gel electrophoresis and ligated 

25 into the cut Xbal site of pML18. After transformation 
and recovery of clones, plasmid DNA was purified from 
several clones and the construct was restriction-mapped 
to determine the relative orientation of the two 
35S: coding region units. A clone was selected which had 

30 the following orientation: In the poly restriction site 
of pGEM9Z, and oriented 3 f to the fl origin of 
replication; at the Xbal site is the 35S promoter 
followed by the coding region of the acyl-ACP 
thioesterase gene described above, followed by the NOS 

35 3' end. Beginning at a second Xbal site is the second 
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35S promoter followed by the hygromycin 
phosphotransferase gene and the second NOS 3« nd. The 
v ctor was given the name, pKR12 . 

A vector with hygromycin selection and antisense 
5 expression of the acyl-ACP thioesterase message was 
obtained by a similar strategy. To obtain compatible 
ends on the acyl-ACP thioesterase transcription unit in ^ 
pKTER, the plasmid was digested with EcoRI and Clal 
which released the 35S promoter, p22B derived sequence 

10 and NOS 3' end as a unit. The EcoRI and Clal sites in 
the cloning region of pBluescript were cut and the 
purified EcoRI Clal fragment from pKTER was ligated into 
pBluescript. A clone was isolated from transformed j= 
£. eoli cells. This clone was cut at the Xbal site 

15 which is in the cloning region of pBluescript to create ; 
one Xbal end. The Sail site at the other end of the 
insert in pBluescript was digested, blunted and Xbal 
linkers were ligated to it to produce the second end. 
The resulting fragment was purified by gel 

20 electrophoresis and ligated into pML18 which had been 
digested by Xbal as above. A single clone was isolated 
from the transformation. This construction, pKR13, was 
determined by restriction mapping to have the same order 
of the two transcriptions units as described for pKR12. 

25 Vectors for transformation of the acyl-ACP j 

thioesterase gene under control of the 35S promoter into 
plant using ftgxajbaflfcsrifflD tmnefaciens were produced by 
constructing a binary Ti plasmid vector system (Brevan, 
Nucl. Acids Res. (1984) 12 : 8711-8720) . The vector 

30 (pZS199) is based on a vector which contains: (1) the 
chimeric gene nopaline synthase/neomycin 
phosphotrasferase as a slectable marker for transformed 
plant cells (Brevan et al., Nature (1984) 304: 184-186), 
(2) the left and right borders of the T-DNA of the Ti i 

35 plasmid (Brevan et al., Nucl. Acids Res. (1984) 12:8711- 
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B720), (3) the £. colt lacZ a-complem nting s gment 
(Vieria et al., Gene (1982) 19:259-267) with unique 
restriction endonucl ase sites for EcoRI, Kpnl, BamHI, 
HinDII, and Sall f (4) the bacterial replication origin 
5 from the pseudomonas plasmid pVSl (Itoh et al., Plasmid | 
(1984) 11:206-220), and (5) the bacterial neomycin f 
phosphotransferase gene from Tn5 (Berg et al,, Proc. 
Natl. Acad. Sci. U.S.A. (1975) 72:3628-3632) as a 
selectable marker for transformed A- tttfflefaciens- The 
10 nopaline synthase promoter in the plant selectable 
marker was replaced by the 35S promoter by a standard 
restriction endonuclease digestion and ligation 

strategy. The 35S promoter is required for efficient j_ 
firapsica napus transformation as described below. j_ 

15 For construction of the antisense expression H 

vector, pZS199 was digested with EcoRI and Sail. pKTER 
was also digested with EcoRI and Sail to release the 
35S: antisense acyl-ACP synthase :N0S transcriptional unit 
which was isolated by gel electrophoresis. The 

20 EcoRI/Sall fragment was ligated into the cut pZSl99 and 
used to transform £. coli competent cells. Isolation of 
a clone and purification of the plasmid DNA gave the 
binary vector pZKR13. 

The sense 35S construction was assembled by j- 

25 removing the acyl-ACP thioesterase coding region and a 1 
portion of the 3' untranslated region from p22B by ! 
digestion with HinDII and Sspl. Sspl cuts after base 
1351 in SEQ ID N0:1. The flindll site was blunted, the 
fragment isolated by gel electrophoresis r and ligated 

30 into the Hindll/BamHI and blunted version of pK35K 
described above. Clones resulting from the 
transformation of £. siQlL were restriction-mapped by 
cutting with BamHI and EcoRI. The sense-oriented insert 
gives a unique 1101 base fragment which is indicative of j 

35 the sense orientation. The resulting plasmid (pKTElO) 
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was cut at the EcoRI and Sail sites described in pK35K 
above and cloned into pBluescript cut with the same 
restriction endonucleas s to give pBT£4 . 

Cloning into the high copy number plasmid pBTE4 
5 allowed isolation of plasmid DNA which was digested with 
Sail and EcoRI • The resulting fragment containing the 
transcriptional unit 35S:acyl-ACP thioesterase:NOS was 
the ligated into pZS199 which had been similarly 
digested to give the desired sense expression vector 
10 pKR12. 

For cloning the thioesterase sequence into existing 
expression vectors containing seed specific promoters , 
an Ncol site was engineered at the start methionine of 
p22B. For this purpose two PCR primers were 
15 synthesized: 

KR40 5 , -AAAAATCTAGAAGCTTTCGTGCCATGGCTTGGACC-3 t )SEQ ID 
NO: 18) corresponding approximately to bases 83 through 
117 in SEQ ID NQ:1. This created an Xbal site 
20 (substitutions at 89 and 91) and an Ncol site 
(subsitution at 105) . 

KR41 5 f -AGCGTACCGGGATCCGCCTCTA-3 f (SEQ ID NO: 19) 
corresponding approximately to the complementary strand 

25 of bases 274 through 296 in SEQ ID NO:l. 

The polymerase chain reaction run with these two 
primers and p22B as the template amplified a 213 base 
pair fragment which contained the restriction 
endonuclease cleavage sites described in KR40 as well as 

30 an existing BamHI site in p22B (bases 282 through 287 in 
SEQ ID NO:l) . 

Most of the 3' untranslated region of p22B was 
removed by digestion with Sspl and HinCII followed by 
re-ligation of the blunt ends to give pBTE8 • Both the 

35 PCR amplified fragment and pBTE8 were digested with Xbal 
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and BamHI. The remaining, linearized pBTE8 derived 
fragment was purified by gel lectrophoresis and the two 
fr gments were ligat d to give the r striction site 
modifi d acyl-ACP thioesteras pPTEl. 
5 The 5 1 and 3 1 regulatory sequences from the * 

phaseolin gene of Phaseolus vulgaris described by Doyle 
et al. (J. Biol. Chem. (19B6) 261:9228-9238) and 
containing the unique restriction endonuelease sites 
Ncol, Smal, Kpnl and Xbal between the 5 1 and 3 1 . 

10 regulatory sequences were placed into the HinDII site in 
the cloning region of a pUC18 plasmid (BRL) to give the 
plasmid pCW108. 

The Ncol to Kpnl fragment cleaved from pPTEl and 
purified by gel ectrophoresis, was ligated into pCWIOB 

15 after digestion with the same two enzymes to give 
plasmid pPHTEl. Removal of the entire phaseolin 
5':acyl~ACP thioest erase: phaseolin 3 1 transcriptional 
unit by digestion with HinDIII, gel purification of the 
fragment and ligation into HinDII I cut pBluescript gave 

20 pPHTE2. Cleavage of pPKTE2 at the EcoRI and Sail sites 
in the cloning region of the original pBluescript 
plasmid released the desired transcriptional unit with 
the EcoRI and Sail sites required for cloning into the 
Binary vector p2S199 as described above to give pZPHTEl. 

25 The promoter region for the 2S2 albumin protein 

from ftrabadopsis tha liana was obtained as 1250 base 
pairs 5* to the Ncol site which is coincident with the 
start ATG as described by Krebbers et al. (Plant 
Physiol. (1988) 87:859-866) along with the 750 base pair 

30 coding region ahead of a 1000 base pair 3' regulatory 
sequence from the octapine synthase (OCS) gene of 
Acrobacterium (DeGreve et al., J. Mol. Appl. Genet. 
(1982) 1:499-511) all contained in a pUC19 cloning 
vector (BRL). The 2S albumin coding sequence was 

35 removed from the vector by digestion with Ncol and Xbal 
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which cleave at the start ATG and just 3« to the 2S 
albumin stop codon in the OCS 3' regulatory sequence. 
The acyl-ACP thio sterase coding sequence from pTEl was 
removed from the remainder of th plasmid hy digestion 
5 with Ncol and Xbal and purified by gel electrophoresis. 
Ligation of the two fragments gave pSTEl. 

A unique EcoRI site at the 5* end of the 2S2 
promoter sequence and a HinDII site 3' to the OCS 3« 
sequence were digested to release the 2S2: acyl-ACP 
10 thioesterase:OCS transcriptional unit. The fragment was 
purified and ligated into the cut EcoRI and HinDII I 
sites described in pZS199 above to give the binary 

F 

vector pZSTEl. i_ 

t 

15 EXAMPLE 6 

TR pMsroBMATTnT3 nr some ttp soybean EMBRYO 0T7LTURES 

Piil f. nre o f EjBbjaagfiniC Suspensions 

Soybean embryogenic suspension cultures were 
20 maintained in 35 mL liquid media (SB55 or SBP6 described 
below) on a rotary shaker, 150 rpm, at 28°C with mixed 
flourescent and incandescent lights on a 16:8 h 

day/night schedule. Cultures were subcultured every ^ 
four weeks by inoculating approximately 35 mg of tissue j 
25 into 35 mL of liquid medium. j 

Trf»npf nrma1:iori 

Soybean embryogenic suspension cultures were 
transformed by the method of particle gun bombardment 
30 (see Kline et al. Nature (1987) (London) 327:70). A 
Du Pont Biolistic® PDS1000/HE instrument (helium 
retrofit) was used for these transformations. 
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TWA/Particle Preparation 

To 50 \LL of a 60 mg/mL 1 (im gold particle 
suspension was added (in order): 5 \IL DNA(1 |xg/|XL) r 
20 px spermidine (0.1 M) , and 50 JlL CaCl 2 (2.5 M) . The 
5 particle preparation was agitated for three min, spun in 
a microf uge . f or 10 sec and the supernatant removed. The 
DNA-coated particles were then washed once in 400 uL 70% 
ethanol and resuspended in 40 |XL of anhydrous ethanol. 
The DNA/particle suspension was sonicated three tiroes 
10 for 1 sec each. Five JIL of the DNA-coated gold 

particles were then loaded on each macro carrier disk. 



Bombardment 

p 

Approximately 300-400 mg of a four~week-old L 

i_ 

15 suspension culture was placed in an empty 60x15 mm petri t 
dish and the residual liquid removed from the tissue 
with a pipette. For each transformation experiment, 
approximately 5-10 plates of tissue were bombarded. 
Membrane rupture pressure was set at 1000 psi and the 

20 chamber was evacuated to a vacuum of 71 cm mercury. The 
tissue was placed approximately 8.9 cm away from the 
retaining screen and bombarded three times. Following 
bombardment, the tissue was placed back into liquid and 
cultured as described above. i~ 

25 Eleven days after bombardment, the liquid media was 

exchanged with fresh SB55 containing 50 mg/mL ■ 
hygromycin. The selective media was refreshed weekly. 
Seven weeks after bombardment, green, transformed tissue 
was observed growing from u nt ran s formed, necrotic 

30 embryogenic clusters. Isolated green tissue was removed 
and inoculated into individual flasks to generate new, 
clonally propagated, transformed embryogenic suspension 
cultures. Thus, each new line was treated as 

independent transformation event. These suspensions j 
35 could then be maintained as suspensions of embryos 
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clustered in an immature developmental stage through 
subculture or r g nerated into whole plants by 
maturation and g rmination of individual somatic 



Matmatlaa find germination 

Transformed embryogenic clusters were removed from 
liquid culture and placed on a solid agar media (SB103) 
containing no hormones or antibiotics. Embryos were 
10 cultured for eight weeks at 2€°C with mixed flourescent 
and incandescent lights on a 16:8 h day/night schedule. 
During this period, individual embryos were removed f roi 
the clusters and analyzed at various stages of embryo 
development . 



SB55 and SBP6 Stock Solutions (grams per liter) : 
20 sulfate mnx stock 



embryos . 



5 



15 



Media : 



MgS0 4 7H20 
MnS04 H2O 
ZnS04 7H20 
CUSO4 5H 2 0 



37.0 



1.69 



0.86 



0.0025 



25 



W c Ha HH»s 100X Stock 
CaCl2 2H 2 0 k 



44.0 



KI 

C0CI2 6H2O 

KH2PO4 

H3BO3 

Na2Mo04 2H2O 



0.083 
0.00125 



30 



17.0 



0.62 



0.025 



TTV y +»' * M.^t f <if 



MS FeEDTA 3 0nx Stock 
N82EDTA 
FeS0 4 7H 2 0 



2.784 



3.724 



5 B5 Vitamin Stock 

10 g m-inositol 
100 rog nicotinic acid 
100 mg pyridoxine HC1 
1 g thiamine 

10 

SB55 (per L) 

10 mL each MS stocks 



1 mL 2,4-D (lOmg/mL stock) 
€0 g sucrose 
0.667 g asparagine 
pH 5.7 



20 

SBE£ <p*r LJ 0-5 mL 2.4-D in SB55 

SB103 (per L) 
MS Salts 



15 



1 mL B5 Vitamin stock 
0.8 g NH4NO3 
3.033 g KNO3 



25 



6% maltose 
750 mg MgCl2 
0.2% Gelrite 
pH 5.7 



30 
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EXAMPLE S 

AffROBACTERIUM ME nTATPn TRANSFORMATION 

Tnhar.r-ft transf ormation 
5 The binary vectors pKR12, p2STEl f and pPHTE were 

transferred by a freeze/thaw method (Holsters et al., 
Mol Gen Genet (1978) 163:181-187) to the Agrnharterium 
strain LBA4404/pAL4404 (Hockema et al., Nature (1983) 
303:179-180). The Aqmbarterimn trans formants were used 
10 to inoculate tobacco leaf disks (Horsch et al., Science 
(1985) 227:1229-1231). Transgenic plants were 
regenerated in selective media containing kanamycin. 

Brassica nanus transformation 

15 Seeds of cultivar £. napus Westar were surface 

sterilized with a solution of 10% Clorox®, 0.1% SDS and 
placed on germination media consisting of 30 mM CaCl2, 
1.5% agar for 5 to 7 days. 

Three mL cultures of Aomhacteri™ tUITiaf aciens 

20 (strain LBA 4404) containing the desired binary vector 
constructions were grown for 18 to 20 h in Min A media 
at 28°C. To begin the transformation, plates of co- 
cultivation media (BC-1 with 100 \m acetosyringone) were 
poured and allowed to air-dry in a laminar flow hood . 

25 Seedling hypocotyls were cut into 1 cm segments and 
placed into 22.5 mL of bacterial dilution medium (MS 
liquid media with 100 pM acetosyringone) . To the 
solution containing the hypocotyl segments was added 
2.5 mL of the overnight culture of Agrobacterium . After 

30 30 min the hypocotyl segments were removed and placed, 
10 per plate, on the co-cultivation media plates . The 
plates were then incubated at 25°C for three days in dim 
light . 

After three days the segments were transferred to 
35 selective media plates (BC-1 media with 200 mg/L 
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carbenicillin and 50 mg/mL kanamycin) . Callus growth 
occur d at the cut ends of the hypocotyls over the next 
20 days f and after 20 days calli greater than 5 nun in 
diamet r wer transferred to selective regen ration 
5 media (BS-48 containing 200 mg/L carbenicillin) . At the b 
same time, the remaining hypocotyl segments were I 
transfered to fresh selective media and additional calli 
developing over the next 15 days were also transferred 
to the selective regeneration media. All calli produced 

10 were thus transferred to selective regeneration media by 
72 days after the co-cultivation with ftyrobactgriupi . 

Individual calli on .selective regeneration media 
were maintained in continous light at 25°C, and placed I 
on fresh media at two week intervals. If no shoot t 

15 primordia appeared after six weeks on the regeneration ^ 
media, the calli were chopped into 5 mm pieces, re- 
plated on BC-1 media containing 200 mg/L carbenicillin 
for three days, then transferred back to BS-48 media 
with 200 mg/L carbenicillin. Shoots appeared three to 

20 six weeks after calli were transferred to BS-48 media. 

Shoots formed on BS-48 were allowed to elongate 
somewhat before excision and plating on MSVA-1A media. 
Shoots were transferred to fresh MSVA-1A media for a 
second, three-week cycle before transplanting directly j~ 

25 into potting mix. ? 

Media (amounts /L) 

BO IMS minimal organic salts medium (MS salts + 100 mg/L 
30 i-inositol and 0,4 mg/mL thiamine) 
30 g sucrose 
18 g mannitol 
3 mg Kinetin 

3 g DNA grade Agarose j 
35 adjusted to pH 5.8 
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SSH8 

MS minimal Organic Medium (as above) 
B5 vitamin s (1 mL of 1000X stock, described above) 
5 250 mg xylose 

10 g glucose 
0.6 g MES 

4 g DNA grade agarose 

10 adjust to pH 5.7 and add from sterile solutions after 
autoclaving; 2 mg Zeatin and 0.1 mg indole acetic acid 

MSVA-1A 

MS minimal organic salts medium 
15 10 g sucrose 

B5 vitamins (1 mL of 1000X stock, described above) 
6 g DNA grade agarose 
adjust to pH 5.8 

20 

RXftMPLE 10 

ANALYSTS OF TRANSGENIC PLANTS 

Analysis of Somatic Soybean Embryos 
25 While in the globular embryo state in liquid 

culture as described in Example 8, somatic soybean 
embryos contain very low amounts of triacylglycerol or 
storage proteins typical of maturing, zygotic soybean 
embryos. At this developmental stage, the ratio of 
30 total triacylglyceride to total polar lipid 

(phospholipids and glycolipid) was about 1:4, as is 
typical of zygotic soybean embryos at the developmental 
stage from which the somatic embryo culture was 
initiated. At the globular stage as well, the mRNAs for 
35 the prominant seed proteins a* subunit of {}-conglycinin, 
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Kunitz Trypsin Inhibitor III and Soybean Seed Lectin 
were essentially absent. Upon transfer to hormone-fr e, 
solid media to allow differentiation to the maturing 
somatic embryo state as describ d in Example 8, 
5 triacylglycerol became the most abundant lipid class. t 
Similarly, mRNAs for a'-subunit of (5-conglycinin, r 

Kunitz Trypsin Inhibitor III and Soybean Seed Lectin 
became very abundant messages in the total mRNA 
population. In these respects, the somatic soybean 

10 embryo system behaves very similarly to maturing zygotic 
soybean embryos in vivo , and is therefore a good and 
rapid model system for analyzing the phenotypic effects 
of modifying the expression of genes in the fatty acid ^ 
biosynthesis pathway such as acyl-ACP thioest erase and t 

15 for predicting the alteratons expected in zygotic N 
embryos. Similar zygotic embryo culture systems have 
been documented and used in another oilseed crop, 
rapeseed (Taylor et al., Planta (1990) 181:18-26). 

20 Assay for in vitro thioesterase activity from globular 
stage somatic soybean cultures of Example R 

Uniform clumps from eighteen of the twenty-one 
transformed lines obtained in Example 8 along with 3 
non-transformed controls, were placed in tared, 1.5 mL (~ 

25 microfuge tubes and re-weighed to obtain the tissue j 
fresh weight. Two times the tissue weight in an 
extraction buffer consisting of 0.1 M Tricine (pH 8.2), 
0.5 mM EDTA and 1 mM DTT was added and the tissue piece 
was homogenized with a small pestle. The homogenate was 

30 centrifuged to clear and 2 \LL of the supernatant was 
added to an assay mixture consisting of 35 JIL of the 
above Tricine buffer also containing 1 mg BSA/mL and 
1 [ 14 C]-oleoyl-ACP (58 mCi/mmol) . The reaction was 
stopped after 2 min by the addition of 100 \IL of 10% j 

35 acetic acid in 2-propanol. Hydrolyzed, 14 C-oleate was 
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extracted from the mixture by two, 1 mL extractions with 
water-saturated hexane and taken for scintillation 
counting. Extract d prot in was det rmined by the 
Bradford assay (Biorad) using 2 JIL of the xtract. Th 
5 results of these assays are shown in Table 4. 



15 



20 



25 



30 
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TABLE a 

THIOESTERASE SPECIFIC 
CULTURE LINE ACTIVITY 
IDENTIFICATION (nmol-mg protein" 1 min -1 ) 



Control 


0.92 


Control 


1.21 


Control 


1.08 


194-5/4 


0.69 


194-6/5 


0.75 


194-6/1 


0.39 


19.4-3,5,6-1 


0.30 


194-5/2 


0.34 


194-6/4 


0.88 


194-5/1 


0.76 


194-3,5,6-2 


0.78 


194-3,4, 6-3 


0.39 


194-1/2 


0.73 


194-4/1 


0.41 


194-2/2 


1.12 


194-6/2 


1.09 


194-6/3 


0.92 


194-1/4 


0.15 


194-1/1 


0.38 


194-5/3 


0.88 


194-5/5 


0.20 * 



30 
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These results w re unexpected because the acyl-ACP 
thioesterase gene was introduced in such a manner as to 
encode the sense message from the gene and therefore the 
production of additional acyl-ACP thioesterase protein I 
5 and corresponding additional enzymatic activity. In the | 
introduction of a gene into tissue in which that same 
gene or one very highly homologus to it is expressed, [ 
cosuppressive inhibition of both messages is a 
possibility. Another factor which was considered is 
10 that the control tissue in this experiment was not 
transformed and grown on the selective media as 

described in Example 8. It is possible that the ^ 
selective media suppresses thioesterase activity and f= 
that the controls utilized were improper. To test for t 

15 this possiblity tissue clumps from the selected, 

transformed lines were removed from media containing the 
selective agent (hygromycin) and allowed to re-grow in 
liquid culture. Assays were again performed as 
described above. The result was identical: no 

20 transformed lines exibited thioesterase-specific 
activities higher than control cultures. When the 
thioesterase specific activities for transformed lines 
grown on selective media was plotted against the . 
specific activity for the same line grown without j 

25 hygromycin the correlation coefficient was 0.85. It is / j 

logical to conclude that the suppression of acyl-ACP 
thioesterase activity is a function of transformation. 

ynrt.hern a p ulysis of selected lines of transformed 
30 soybean Sfl ffiatifi embrvo cultures 

Nine of the twenty-one somatic embryo lines chosen 

as representative of lines with greatly decreased acyl- 

ACP thioesterase-specific activity, of lines with only ; 

moderately decreased activity, and of lines which do not * 
35 appear to be different from untransformed controls were 
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grown for further study. Total RNA was obtained from 
transformed soybean somatic embryo cultures by the 
Ph nol/SDS Method (Current Protocols in Molecular 
Biology, Ed. F. M. Ausubel et al. f (1991) John Wiley and 
5 Sons, pg. 4,3,1-4,3,3). 

Poly A + mRNA was isolated by oligo dT affinity 
chromatography as described by Aviv et al. (Proc. Nat. 
Acad. Sci. U.S.A. (1972) 69:1408-1412. Two )ig of polyA + " 
mRNA was separated from each transformed soybean culture 

10 line in a denaturing formaldehyde gel for an RNA blot 
analysis as described by Lehrach et al. (Biochemistry 
(1977) 16:4743-4749). Standards containing known amounts 
of pure acyl-ACP thioesterase mRNA were included in the \ 
gel for quantitation of acyl-ACP thiosterase mRNA in the l 

15 transgenic lines. The standard was synthesized in vitro f 
using the method of Krieg et al., Nucl. Acids Res. 
(1984) 12:7057-7070) with p22B as the template DNA. The 
gel-separated mRNA was transfered to Nytran filter and 
hybridized with 32 P-labelled soybean thioesterase RNA 

20 probes as described by Berger et al. (Methods Enzymol. 
(1987) 152:577-582), again using p22B as the template. 
Hybridization was at 68°C in 50% formamide, 0.5 M NaCl,. 
lOx Denhardt f s, 0.2% SDS, 250 Jlg/mL yeast RNA. The 
filter was washed at 68°C once in 2xSSC 30 min and four 1 

25 times in 0.2xSSC at 68°C 30 min each. The filter was 

exposed to X-ray film overnight at -80°C with a Du Pont 
Cronex® intensifying screen. 

The construction of pKR12 (see Example 7) deletes a 
portion of the 5 1 untranslated region of the soybean 

30 seed acyl-ACP thioesterase message. As a result, the 
expected message size for expressed acyl-ACP 
thioesterase transgene is about 200 base pairs smaller 
than the message from expressed endogenous genes. In 
all nine lines a message of about 1.6 kB in size was 1 

35 present in the somatic soybean embryos. In all but line 
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10 



194-6/3 a second message of about 1.4 kB in size was 
also present. After probing with the acyl-ACP 
thioesterase probe, the blots were stripp d of labelling 
by continued washing as above and re-probed with [32 P ]- 
rna prepared as described above but using a Bluescript 
plasmid containing the cDNA for soybean seed Oleosin in 
the insert. The oleosin message is highly expressed in 
the somatic soybean embryos and was used to normalize 
the amount of mRNA loaded from each line. The two lines 
expressing greatly reduced acyl-ACP thioesterase 
activity (194-5/5 and 194-1/4) also had greatly reduced 
levels of both the transgene acyl-ACP thioesterase 
message and the endogenous acyl-ACP thioesterase 
message. Lines 194-1/4 and 194-5/4 had slightly reduced 
15 levels of both messages although it appeared that the 
endogenous message was decreased in relation to the 
transgene message. The level of both messages was 
somewhat lower in line 194-5/4 than in line 194-1/4. 
Line 194-6/3 had only the endogenous message but lines 
20 194-6/4 and 194-6/5 had high levels of both the 

transgene and endogenous gene messages, while all three 
of these lines had acyl-ACP thioesterase activities at 
or near the wildtype level. The single message signal 
in 194-6/3 is explained by the lack of an introduced 
25 acyl-ACP thioesterase gene in this line (see Southern 
analysis below) but the lack of effect of the expressed 
acyl-ACP thioesterase meassage in lines 194-6/4 and 194- 
6/5 is not simply explained. The reduced message levels 
in the remaining lines correlates exactly with reduced 
30 acyl-ACP thioesterase activity and are diagnostic of co- 
supression as seen when highly homologous messages of 
slightly differing size are produced (van der Krol 
et al. f The Plant Cell (1990) 2:291-299). 

The sequence of SEQ ID N0.:1 or any nucleic acid 
35 fragment substantially homologous therewith is therefore 
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shown to be effective in reducing acyl-ACP thioesterase 
activity by cosuppresion when re-introduced into soybean 
and expressed in an appropriate xpresson v ctor. 

5 Southern analysis of genomic DNA from transformed 
somatic soybean embrvos 

Genomic DNA was isolated from maturing somatic 
embryos from the 7 surviving lines described below and 
digested with Xbal as described in Example 3. Southern 
10 analysis was also done as in Example 3 using either the 
acyl-ACP thioesterase coding sequence as the probe 
template or the neomycin phosphotransferase coding 
sequence as the probe template. Using the coding 

sequence of acyl-ACP thioesterase as the probe revealed f~ 
15 that all lines except 194-6/3 contained introduced 
copies of the sequence in addition to the endogenous 
copies. All lines except 194-5/5 had at least one copy 
which was not rearranged from the introduced pKR12 
construction. Line 194-5/5 had multiple inserts, all of 
20 which had undergone some rearrangement. Probing with 
the neomycin phosphotransferase coding sequence showed 
that all transformed lines had at least one copy of the 
selectable marker. Occurrence of copies ranged from one 
in the case of line 194-6/3 to eight in the case of line 
25 194-5/5. 

Analysis Q f fa tty acid profiles and triacvlalvcerol 
syntheses in transformed soybean somatic embrvos 

Seven of the transformed lines from Example 8 were 
30 successfully grown on solid, hormone free, maturation 
media. These lines were used for growth rate analysis, 
analysis of the rate of triacylglycerol synthesis, and 
analysis of the fatty acid profile of the 
triacylglycerol . 
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Following placement on the maturation media and 
subsequent differentiation of the globular embryo 
culture into the maturing embryos , four r plicate 
samples of five embryos each per line wer taken at 
5 intervals. The length of time to differentiation varied t 
with culture line, but embryos produced by each line V 
were of very similar fresh and dry weights at the point 
of differentiation. This point, at which differentiated 
embryos could be easily removed from the remaining 
10 globular culture was designated as "time 0* in the 
course of triacylglycerol synthesis and dry weight 
accumulation. 

Embryos from each line and time point were weighed j= 
for fresh weight, lyophilized and re-weighed for dry t 

15 weight and lipid extraction. An internal standard of r 
tri-heptadecanoyl glycerol was prepared by reacting the 
acid chloride of heptadecanoic acid with glycerol in 
dimethylformamide (DMF) with triethyl amine. The 
triacylglyceride was purified by passage through silica, 

20 crystalized from diethyl ether and used to make a 

0.5 mg-m" 1 standard solution in 2-propanol. Addition of 

100 \IL of the standard solution to the extraction 

solvent for each sample gave an internal standard of 50 

|lg which was co-purified, derivatized and t 

25 chromatographed with the extracted lipid. In addition 
to the internal standard solution , the embryos were 
ground in 0.5 mL of diethyl ether and centrifuged. The 
ether layer was removed and the extraction was repeated. 
The combined extracts were passed through a prepared 

30 silica column (Sep Pak silica cartridge, Millipore) and 
the neutral lipid fraction was eluted with 2 mL of 
diethyl ether. The column eluate was taken to dryness 
under an N2 stream and neutral lipids in the residue 
were transesterified to methanol in 0.5 mL of 1% sodium j 

35 methoxide in methanol. One mL of a saturated NaCl 
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solution was added and the fatty acid methyl esters were 
extract d into diethyl th r. The ether solutions were 
taken to dryness under an N2 stream and the extracted 
methyl sters wer r -disolved in between 50 and 200 JIL 
of hexane (depending on the embryo age) for analysis by 
GLC. GLC seperations were done isothermally at 185° on 
a fused silica capillary column (stationary phase, SP- 
2330 , 30 M in length, Supelco, Belief onte f PA) . Data 
were analyzed by integration relative to the assigned 
weight of the internal standard peak to determine both 
the absolute weight of total fatty acids in 
triacylglycerol and the relative contribution of each of 
the five most prominant fatty acids in soybean 
triacylglycerol . 

The specific activity of acyl-ACP thioesterase was 
also analyzed in the maturing embryos at mid-maturation 
by the method described above. The relative 
contributions of individual fatty acids to the total 
fatty acid profile, total amount of triacylglyceride 
synthesized, and the specific activity of the acyl-ACP 
thioesterase for the seven transformed lines and one 
untransformed control are given in Table 5. 
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TABLE 5 



Total 
Triacyl- 



Thioest ras 
Sp. Activity 





% of Total Fatty 


Acids 




glycerol 




Cell Line 


16:0 


18:0 


18:1 


18:2 


18:3 


[ % dry wt . > 




Control 


14.9 


3.5 


9.9 


56.6 


15.0 


5.7 


1.07 


194-6/3 


16.5 


3.4 


9.8 


50.8 


18.9 


5.8 


1.06 


194-6/4 


13.8 


3.0 


9.1 


56.6 


16.5 


7.6 


0.98 


194-6/5 


11.1 


2.9 


10.7 


57.0 


15.6 


ND 


0.80 


194-2/4 


13.2 


2.7 


10.6 


59.1 


13.6 


6.2 


0.70 


194-5/4 


9.B 


3.0 


11.0 


57.8 


16.8 


6.9 


0.64 


194-1/4 


17.4 


3.7 


7.6 


51.5 


19.6 


3.6 


0.24 


194-5/5 


178.2 


4.6 


4.7 


46.2 


25.8 


2.4 


0.22 



5 The fatty acid profile values in Table 5 are the 

means of four to six determinations. The thioesterase 
specific activities are the means of three assays f two 
done at the globular tissue stage and one at the 
developing embryo stage. 

10 The results show that the nucleotide sequence of 

SEQ ID NO:l is effective in altering seed storage lipid 
biosynthesis. Moderate reduction of the acyl-ACP 
thioesterase activity does reduce the level of saturated 
fatty acid in triacylglycerol (the 16:0 value for line 

15 194-5/4 is signifigantly lower than the control values) . 
Fold reduction of the acyl-ACP thioesterase activity in 
the range of 5 or greater leads to additional effects; 
the total accumulation of triacylglycerol was 
signifigantly decreased and it is likely that the rate 

20 of triacylglycerol synthesis was also decreased. 
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Analysis of tobacco transformed with constitutive and 

seed specific constructions 

Tobacco plants transformed with pKZ12, pKZ13, 
p2STEl f and pPHTEl (see Example 9) were analyzed for 
acyl-ACP thioesterase activity. Those plants tranformed 
with the constitutive constructs pKZ12 and pKZ13 were 
analyzed at the callus level, at the seedling stage just 
after transfer to pots r and in the developing seeds. 
Seven developing plants were obtained from 
transformations with pKR12 and six from transformations 
with pKR13. Of the seven pKR12 transf ormants, two 
showed acyl-ACP thioesterase specific activity that was 
higher than control plants in very young seedlings. One 
of those plants (KR12-4B) maintained measureably higher 
levels of thioesterase activity in developing seeds. 
Tobacco seeds undergo a marked, developmental change in 
seed acyl-ACP-thioesterase activity. Since it is 
difficult to determine seed developmental age with 
accuracy, determining increased thioesterase activity 
relative to controls is also imprecise. Nevertheless, 
it appeared that plant KR12-4B retained about a two-fold 
increased acyl-ACP thioesterase specific activity in the 
seeds. Twenty-two immature seeds from the segregating 
population of seeds on KR12-4B were individually assayed 
for thioesterase activity on a per seed basis. Three 
individuals of the twenty-two had acyl-ACP thioesterase 
activity in the range of 1.2 to 1.7 nmol/10 min/seed. 
Five seeds had activity in the range from 3.3 to 3.9 
nmol/10 min/seed, while the remaining fourteen fell in 
the range between 2 and 3 nmol/10 min/seed. This ratio 
is reasonably near the 1:2:1 ratio that would be 
predicted for the segregating population from a single 
effective transgene insert if each gene dose of the 
transgene gives acyl-ACP thioesterase activity 
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approximately equal to that from the endogenous gene in 
this plant • 

Eleven tobacco plants transformed with p2STEl and 
six tobacco plants transformed with pPHTEl have been | 
5 assayed for acyl-ACP thioesterase activity in developing | 
seeds. Of the p2STEl transformed plants, five did not j 
appear to be different from wildtype in activity, three 
were clearly higher than wildtype and three others were 
not at developmental stages which allowed comparison. 

10 The three transf ormants which had higher activity were 
judged to have 1.9, 2.0 and 2.6 times the acyl-ACP 
thioesterase-specific activity of untransformed controls 
at an equivalent develomental stage. Of the six pPHTEl f= 
transf ormants assayed, two could not be compared t 

15 reliably due to their immature developmental stage, two 1 
were approximately equal to wild-type f and two had 
higher activity. These two lines measured 2.5 and 2.9 
fold higher than, equivalent control seeds. 

Applicants have shown that either constitutive or 

20 seed specific expression of the soybean seed acyl-ACP 

thioesterase gene in a plant may give increased acyl-ACP 

thioesterase activity provided that endogenously 

expressed acyl-ACP thioesterases are not excessively 

homologus to the introduced gene. "j" 

25 § 

Analysis of transformed Brassies napus 

ftrassica napus transformed with pKR13 as described 
in Example 9 was analyzed for acyl-ACP thioesterase 
specific activity at the stage of transformed callus 
30 after re-induction on hormonal media as described in 
Example 9. Calli from twenty-eight individual 
transformants along with four control calli were assayed 
by grinding the callus with a pestle in a 1.5 mL 

microfuge tube after addition of a buffer concentrate ' 
35 consisting of 10 (iL of 0,1 M Tricine, pH 8 and 10 mM 
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DTT. The hompgenate was centrifuged to clear and 5 JIL 
of the supernatant was used in the acyl-ACP thioesterase 
assay as described abov . Th assay value for each 
transformant was compared to the control average and 
5 then placed in classes of 10% intervals to produce 
frequency distribution. See Table 6. 

TABLE 6 

10 CLASS 

r* of control! FREQUENCY 
110-90 . 7 

89-80 3 

79-70 4 
15 69-60 7 

59-50 2 

49-40 1 

39-30 1 

29-20 3 

20 



25 



30 
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(1) GENERAL INFORMATION: 

<±) APPLICANT: Hitz, William D. 
x ' Yadav, Narendra S. 

<ii) TITLE or INVENTION: Nucleotide Sequence* of Soybean Acyl-ACP 
Thioesteraae Genes 

(iii) NUMBER Or SEQUENCES: 22 

(iv) CORRESPONDENCE ADDRESS: ^ M «nours and Company 

(A) ADDRESSEE: E. I. du Pont de Nemours ana 

(B) STREET: 1007 Market Street 

(C) CITY: Wilmington 

(D) STATE: Delaware 

(E) COUNTRY: U.S.A 
<D ZIP: 19898 

(v) COMPUTER READABLE TORM: 

(A) MEDIUM TYPE: Tloppy disk 

(B) COMPUTER: IBM PC compatible 

C OPERATING SYSTEM: PC-DOS/MS-DOS 
(D) SOrTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) TILING DATE: 

(C) CLASSiriCATION: 

(vii) PRIOR APPLICATION DATA: M ,„- ogiA 

11 (A) APPLICATION NUMBER: US 07/631,264 
(B) TILING DATE: 20-DEC-1990 

(viii) ATTORNEY/ AGENT INTORMATION: 

(A) NAME: Morrissey f Bruce W. 

(B) REGISTRATION NUMBER: 30 r 663 

(C) REFERENCE /DOCKET NUMBER: CR-8926-A 

fix) TELECOMMUNICATION INFORMATION: 
<A) TELEPHONE : (302) 992-4927 

(B) TELEFAX: (302) 892-7949 

(C) TELEX: 835420 

(2) INFORMATION TOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1602 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Glycin max 

(B) STRAIN: Cultivar Wye 

(D) DEVELOPMENTAL STAGE: Early 8 ed fill 

(E) HAPLOTYPE: Diploid 

(P) TISSUE TYPE: Cotyledon 
(I) ORGANELLE: Nucleus 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA to xnRNA 

(B) CLONE: 22B 

(ix) FEATURE: 

(A) NAME/KEY: aat_peptide 

(B) LOCATION: 271.. 1206 

(C) IDENTIFICATION METHOD: Catalytically active when 
expressed in £. coli 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 106.. 1209 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCTTCTTTCT CATTCTCATA CGCACCCAGT CACCCAGCTT TCCCTTTTTC CTATTTTTTT 60 

TCTCTTTTTT TATTAAAAAA ATAAAAATGT TGAAGCTTTC GTGCA ATG GCT TGG 114 

Met Ala Trp 
-55 



ACC GGG CTC ACT CCC TGG CCC AAT GCG CTT CCG GGC CGG CCC GCC TGC 162 
Thr Gly Leu Thr Pro Trp Pro Asn Ala Leu Pro Gly Arg Pro Ala Cys 
-50 -45 -40 

GCC GTC CCT CGC CGG AGG AGG AGO GGC GTC TCC GGA TTC CGG TTG CCG 210 
Ala Val Pro Arg Arg Arg Arg Ser Gly Val Ser Gly Phe Arg Leu Pro 
-35 -30 -25 

GAA GGC AGG TCG ATC CGG GTG TCC GCG GCG GTG TCG GGA AAG GAC GGC 258 
Glu Gly Arg Ser lie Arg Val Ser Ala Ala Val Ser Ala Lys Asp Gly 
-20 -15 -10 -5 

GCG GTG GCG ACC CGG GTA GAG GCG GAT CCC GGT ACG CTG GCG GAC CGG 306 
Ala Val Ala Thr Arg Val Glu Ala Asp Pro Gly Thr Leu Ala Asp Arg 
1 5 10 

CTG AGG GTG GGG AGO TTG ACG GAG GAT GGG TTG TCT TAT AAG GAG AAG 354 
Leu Arg Val Gly Ser Leu Thr Glu Asp Gly Leu Ser Tyr Lys Glu Lys 
15 20 25 

TTC ATT GTG AGG AGC TAC GAA GTT GGG ATC AAT AAG ACT GCC ACT GTT 402 
Phe lie Val Arg Ser Tyr Glu Val Gly lie Asn Lys Thr Ala Thr Val 
30 35 40 

GAA ACC ATT GCC AAT CTC TTG CAG GAG GTT GGA TGT AAT CAT GCT CAG 450 
Glu Thr lie Ala Asn Leu Leu Gin Glu Val Gly Cys Asn His Ala Gin 
45 50 55 60 



AGT GTT GGA TAT TCT ACT GAT GGT TTT GCA ACC ACC CCT ACG ATG AGA 
Ser Val Gly Tyr Ser Thr Asp Gly Phe Ala Thr Thr Pro Thr Met Arg 
65 * 70 75 



498 



95 



110 
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TTG CGT CTC ATA l » J -- L " 7 «7- n 

Su 2£ Leu II Trp VI Thr Ale Arg Met Hla II 

80 85 

TGG ACT GAC ATT CTT GAG ATA GAG ACA TGG TGC CAA 594 
TGG *° iie Val Glu lie Glu Thr Trp Cys Gin 

JWUI WWW 

_ Leu Lye Asp Tyr 

115 * " " ' " "0 

GGA AGG GCA ACA AGC AAA TGG 
^ Gly Arg Ale Thr Ser Lys Trp 
130 135 

CGA CTC CAG AAG GTT TCT GAT GAT 
Arg lieu Gin Lys Val Ser Asp Asp 
lil 150 

TTC TCT CCT CGA GAG CCC AGG TTA OCT 
eh. evs Pro Ara Glu Pro Arg Leu Ale 
165 

ATA CCA AAA TTG GAA 
He Pro Lys Leu Glu 
180 I 85 

GTG CCA AGA AGA GCG GAT 
_ Val Pro Arg Arg Ale Asp 
195 200 



125 



160 



175 



190 



205 



210 



225 



255 



270 



2B5 



240 245 
GCG ATA CAG GCT GGT GCC GAG GCA GTT CCA 
Ala lie Gin Gly Gly Ala Glu Ala Val Pro 
260 265 

GGA TCT GCC ACG GCA AGG GAA GAC AAA CAT 
Glv Ser Ala Thr Ala Arg Glu Asp Lys His 

275 . 280 

CAT CTA CTT AGG TTG TCT ACT GAA GGA CTT 
His Leu Leu Arg Leu Ser Thr Glu Gly Leu 
290 295 



GAA ATC 
Glu lie 
90 


TAC 
Tyr 


546 


TGG TGC 
Trp Cys 


CAA 
Gin 


594 


AAA GAC 

Lys Asp 


miff 

TAJ. 

Tyr 


642 


pip* mtji/j 
IjlA Alw 

Val Met 


AAVJ 

Met 
140 


690 


MM « « « 

GTT AAA 

val Lys 

155 


(jAA 

Glu 


738 


ATT CCA 
lie Pro 
170 


GAG 
Glu 


786 


GAC CCT 
Asp Pro 


GCT 
Ala 


634 


CTG GAC 
Leu Asp 


ATG 

Met 




CTT GAG 
Leu Glu 


« /■»/*"» 
AGC 

Ser 

220 




All AvU 

lie Thr 
235 


Leu 


978 


TCC CTC 
Ser Leu 
250 


ACT 
Thr 


1026 


GAA CTG 
Glu Leu 


AAA 
Lys 


1074 


GAA GAC 
Glu His 


CAG 
Gin 


1122 


GAG ATA 
Glu He 


AAC 
Asn 

300 


1170 



215 



230 

I- 

3 

i 

\ 



CGG GGA CGA ACA GAA TGG AGA AAG AAA GCT CCA AGA TGAGAACCAT 1216 
Arg Gly Arg Thr Glu Trp Arg Lys Lys Ala Pro Arg 
305 310 
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TATGTGTGCT TCCACCCGAA TCCATGHTTC TGTTTTTGTC TTGTGTTGTT TCATGTTACC 1276 

AGGGTTGTCT TATCAATTTT CCCTTGATAT TTTGCTTAGA GTTTGTGCGC TTAATAGGGX 1336 

TTGAAGAGTT AAAATATTGC TTCTGTTTTC TTGTCATGCT GATCAAAAAT TTAAGTTGTC 1396 

CAAATCCCGT AGTTAGGCTA TATAGGTTGA CATCAATCTC TGATCCATTA GTATCAGATT 1456 

CCATGAATCT CATTCTACCT TAAGGGAGCA TAGAAATCCA GGAAGTTGGT ATGGATCTGC 1516 

CATCTACTGC ATGACTTGAA CAATGTGTGT TAAAATAATC ATTTTGAAAT AATTCAATTA 1576 
GCTAATTATT AATGTTCTTA AAAAAA 

(2) INFORMATION TOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 

Met Ala Trp Thr Gly Leu Thr Pro Trp Pro Asn Ala Leu Pro Gly Arg 
-55 -50 -45 -40 

Pro Ala Cys Ala Val Pro Arg Arg Arg Arg Ser Gly Val Ser Gly Phe 
-35 -30 -25 

Arg Leu Pro Glu Gly Arg Ser He Arg Val Ser Ala Ala Val Ser Ala 
* -20 -15 -10 

Lys Asp Gly Ala Val Ala Thr Arg Val Glu Ala Asp Pro Gly Thr Leu 
-5 1 5 

Ala Asp Arg Leu Arg Val Gly Ser Leu Thr Glu Asp Gly Leu Ser Tyr 

10 15 20 25 

Lys Glu Lys Phe He Val Arg Ser Tyr Glu Val Gly He Asn Lys Thr 
* 30 35 40 

Ala Thr Val Glu Thr He Ala Asn Leu Leu Gin Glu Val Gly Cys Asn 
45 50 55 

His Ala Gin Ser Val Gly Tyr Ser Thr Asp Gly Phe Ala Thr Thr Pro 
60 e 65 70 

Thr Met Arg Lys Leu Arg Leu He Trp Val Thr Ala Arg Met His He 
75 80 85 

Glu He Tyr Lys Tyr Pro Ala Trp Ser Asp He Val Glu He Glu Thr 
90 95 100 105 

Trp Cys Gin Gly Glu Gly Arg Val Gly Thr Arg Arg Asp Phe He Leu 
r 110 115 120 

Lys Asp Tyr Ala Thr Asp Glu Val He Gly Arg Ala Thr Ser Lys Trp 
125 130 135 
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Val Met Met Asn Gin Asp Thr Arg Arg Leu Gin Lys Val Ser Asp A*> 
140 14D 

V.1 Lys Glu Glu Tyr Leu Val Ph Cy. Pro Arg Glu Pro Arg Leu Ale 

155 * 
II. P xo Glu »U Up S.r U» «r L.U Ly, ty. II. Pro «*• «« 

170 175 

„p Pro Ala Gin Tyr St «g »» Gly «» VI Pro Ai, U, Jlj Up 

Leu Asp Mat ».n Gin Hi. VI U» Un v.1 Thr Tyr II. Gly Trp v.1 

205 210 

ti» Tie Abo Ser His Glu Leu Gin Ser 
Leu Glu Ser Met Pro Gin Glu lie lie asp wr ~. 

220 225 
lie Thr Leu Asp Tyr Arg Arg Glu Cys Gly Gin His A» P He Val Aep 



235 



240 



Ser Leu Thr Ser Val Glu Ala He Gin Gly Gly Ale Glu Ala Val Pro 

255 



250 



Glu Leu Lys Gly Thr Asn Gly Ser Ala Thr Ale Arg Glu Asp Lys Hi. 



270 



Glu His Gin Gin Phe Leu His Leu Leu Arg Leu Ser Thr Glu Gly Leu 
285 

Glu lie Asn Arg Gly Arg Thr Glu Trp Arg Lys Lys Ala Pro Arg 

300 305 

(2) INFORMATION TOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear j_ 

* 

3 

(ii) MOLECULE TYPE: cDNA to xnRNA ] 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Glycine max 
(By STRAIN: Cultivar Wye 

(D) DEVELOPMENTAL STAGE: Early seed fill 

(E) HAPLOTYPE: Diploid 

<F) TISSUE TYPE: Cotyledon 
(I) ORGANELLE: Nucleus 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA to mRNA 

(B) CLONE: 4C i 
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(ix) rzhi 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 282.. 1217 

(iz) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 117.. 1220 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCTTCAAAAC CACTTGTTTC TTCAGTTCCA CTCTGCTTCT TCCCCTTTCT CTTCTCATAC 60 

TCACCCA6CT TTCCTTTTTA TTAAAAAACA AAAAAAAATG TTGAAGCTTT CGTGCA 116 

ATG GCT TGG ACC GGG CTC ATA TGC TGG CCC AAT GCG TTT GCG GGC CGG 164 
Met Ala Trp Thr Gly Leu He Cya Trp Pro Asn Ala Phe Ala Gly Arg 
-55 -50 ^ -45 -40 

GGC CGC TGC GCT CGT CCC AGC CGG AGG ATA AGC GGC ATC TCC GGA TTC 212 
Gly Arg Cys Ala Arg Pro Ser Arg Arg He Ser Gly He Ser Gly Phe 
-35 -30 -25 

TGG TCC CCG GAA GGA GGG CGG ATC CGG GTG TCG GCG GTG GTG TCG GCG 260 
Trp Ser Pro Glu Gly Gly Arg He Arg Val Ser Ala Val Val Ser Ala 
-20 -15 -10 

AAG GAT GGC GCG GTG GCG ACC CGG GTG GAG GCG GAG TCC GGG ACG CTG 308 
Lys Asp Gly Ala Val Ala Thr Arg Val Glu Ala Glu Ser Gly Thr Leu 
-5 15 

GCG GAC CGG CTG AGG GTG GGG AGC TTG ACG GAG GAT GGG TTG TCT TAC 356 
Ala Asp Arg Leu Arg Val Gly Ser Leu Thr Glu Asp Gly Leu Ser Tyr 
10 " 15 20 25 

AAG GAG AAG TTC ATT GTG AGG AGC TAC GAA GTT GGG ATC AAT AAG ACT 404 
Lys Glu Lys Phe He Val Arg Ser Tyr Glu Val Gly He Asn Lys Thr 
30 35 40 

GCC ACT GTT GAA ACC ATT GCT AAT CTC TTG CAG GAG GTT GGA TGT AAT 452 
Ala Thr Val Glu Thr He Ala Asn Leu Leu Gin Glu Val Gly Cys Asn 
45 50 55 

CAT GCT CAG AGT GTT GGA TAT TCT ACT GAT GGT TTT GCA ACC ACC CCT 500 
His Ala Gin Ser Val Gly Tyr Ser Thr Asp Gly Phe Ala Thr Thr Pro 
60 65 70 

ACG ATG AGA AAA TTG CGT CTC ATA TGG GTT ACT GCT CGC ATG CAC ATT 548 
Thr Met Arg Lys Leu Arg Leu He Trp Val Thr Ala Arg Met His He 
75 80 85 

GAA ATC TAC AAA TAC CCT GCT TGG AGT GAC GTT GTT GAG ATA GAG ACA 596 
Glu He Tyr Lys Tyr Pro Ala Trp Ser Asp Val Val Glu He Glu Thr 
90 ^ 95 100 105 

TGG TGC GAA GGT GAA GGA AGG GTT GGG ACA AGG CGT GAT TTT ATA CTG 644 
Trp Cys Gin Gly Glu Gly Arg Val Gly Thr Arg Arg Asp Phe He Leu 
110 115 120 
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... ~ T GCA ACT GAT GCA GTC ATX GGA AGG GCA ACA AGC AAA TGG 
^ Sp £r Ala £ Sp Ala Val lie Gly Arg Ala Thr Ser Iff. *rp 

GTA ATG ATG AAT CAG GAC ACC AGA CGA CTC CAG AAA GTT TCT GAT GAT 
35 St ft Asn Gin Asp Thr Arg Arg Leu Gin Lys Val Ser Asp Asp 
X40 145 

GTT AAA GAA GAG TAT TTG GTT TTC TGT CCT CGA GAG OCC AGG TTA GCA 788 
Lys Su Glu Tyr Leu Val Phe Cys Pro Arg On Pro Arg Leu Ala 
155 160 1€5 

ATT CCA GAG GCA GAT AGC AAT AAC TTG AAG AAA ATA CCG AAA TTG GAA 836 
S Pro Glu Ala Asp Ser Asn Asn Leu Lys Lys He Pro Lys Leu Glu 
170 175 180 185 

SAC CCT GCC CAG TAT TCC AGA CTT GGA CTT GTG CCA AGA AGA GCG GAT 884 
Asp Pro Ala Gin Tyr Ser Arg Leu Gly Leu Val Pro Arg Arg Ala Asp 
190 195 *uu 

CTG GAC ATG AAT CAG CAT GTT AAC AAT GTC ACC TAT ATT GGA TGG GTG 932 
Su Sp Met AsJ Gin His Val Asn Asn Val Thr Tyr lie Gly Trp Val 
205 210 215 

CTT GAG AGC ATG CCT CAA GAA ATC ATT GAT AGT CAT GAG TTG CAG ACT 980 
Su Glu Ser Met Pro Gin Glu He He Asp Ser His Glu Leu Gin Ser 
220 225 230 

ATT ACC TTG GAT TAC AGA CGA GAG TGC GGA CAG CAT GAC ATA GTT GAT 1028 
£e Thr Su Asp Tyr Arg Arg Glu Cys Gly Gin His Asp He Val Asp 
235 2 *0 245 

TCC CTC ACT ACT CTG GAA GAA ATC CAG GGT GCT GCC GAG GCA CTT TCA 1076 
IS £er VaT Glu Glu He Gin Gly Gly Ala Glu Ala Val Ser 

250 2SS 2S0 2 «5 

GAA CTG AAA ACT ACA AAT GGA TCT GCC ATG GCA AGG GAA GAC AAA CAT 1124 
Glu Leu Lys Ser Thr Asn Gly Ser Ala Met Ala Arg Glu Asp Lys His 
270 275 280 

GAA CAC CAG CAG TTT CTG CAT CTA CTT AGG TTG TCT ACT GAA GGA CTT 1172 
Glu His Gin Gin Phe Leu His Leu Leu Arg Leu Ser Thr Glu Gly Leu 
285 290 295 

GAG ATA AAC CGG GGA CGA ACG GAA TGG AGA AAG AAA GCT CCA AGA TGAGAACCAT 
1227 

Glu He Asn Arg Gly Arg Thr Glu Trp Arg Lys Lys Ala Pro Arg 
300 305 310 

TACCTCTGCT TCCACCCAAA TCCATGATTC TCTTTTTGTC TTTCTTCTGT TGTTTCACGT 1287 

TACCAGGGTT ATGAACTTAT CAATTTTCCC TTTATATTTT GCTTAGAGTT TGTGGACCCT 1347 

TAATAGGGGA TTGGAGGAGT TAAAATTTTG TCGCTCTTTT CTTCTCATGC TCACAAATTT 1407 

AAATTGTTGG AATTCATCAT CAAGCTTATC GATACCCTCG ACCTCGAGGS GGGGCCCGCT 1467 
ACCCAATTC 



1476 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Trp Thr Gly Leu lie Cys Trp Pro Asn Ala Phe Ala Gly Arg 
-55 -50 -45 -40 

Gly Arg Cya Ala Arg Pro Ser Arg Arg He Ser Gly He Ser Gly Phe 
-35 -30 -25 

Trp Ser Pro Glu Gly Gly Arg He Arg Val Ser Ala Val Val Ser Ala 
-20 -15 -10 

Lys Asp Gly Ala Val Ala Thr Arg Val Glu Ala Glu Ser Gly Thr Leu 
J -5 1 5 

Ala Asp Arg Leu Arg Val Gly Ser Leu Thr Glu Asp Gly Leu Ser Tyr 

10 15 20 25 

Lys Glu Lys Phe He Val Arg Ser Tyr Glu Val Gly He Asn Lys Thr 
30 35 40 

Ala Thr Val Glu Thr He Ala Asn Leu Leu Gin Glu Val Gly Cya Asn 
45 50 55 

His Ala Gin Ser Val Gly Tyr Ser Thr Asp Gly Phe Ala Thr Thr Pro 
60 €5 70 

Thr Met Arg Lys Leu Arg Leu He Trp Val Thr Ala Arg Met His He 
75 80 85 

Glu He Tyr Lys Tyr Pro Ala Trp Ser Asp Val Val Glu He Glu Thr 
90 95 100 105 

Trp Cys Gin Gly Glu Gly Arg Val Gly Thr Arg Arg Asp Phe He Leu 
110 115 120 

Lys Asp Tyr Ala Ser Asp Ala Val He Gly Arg Ala Thr Ser Lys Trp 
125 130 135 

Val Met Met Asn Gin Asp Thr Arg Arg Leu Gin Lys Val Ser Asp Asp 
140 145 150 

Val Lys Glu Glu Tyr Leu Val Phe Cys Pro Arg Glu Pro Arg Leu Ala 
155 160 165 

He Pro Glu Ala Asp Ser Asn Asn Leu Lys Lys He Pro Lys Leu Glu 
170 175 180 185 

Asp Pro Ala Gin Tyr Ser Arg Leu Gly Leu Val Pro Arg Arg Ala Asp 
190 195 200 

Leu Asp Met Asn Gin His Val Asn Asn Val Thr Tyr He Gly Trp Val 
205 210 215 
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Leu Glu Ser Met Pro Gin Glu Il g lie Asp S r Hie Glu Leu Gin 8 r 

220 

He Thr Leu Asp Tyr Arg Arg- Glu Cye Gly Gin His Asp He Val Asp 

235 240 
Ser Leu Thr Ser V.1 Glu Glu He Gin Gly Gly Ala Glu Ala Val Ser 
250 255 

Clu Leu Lys Ser Thr Asn Gly Ser Ala Met Ala Arg Glu Asp Lys Hie 

Glu HiB Gin Gin Phe Leu His Leu Leu Arg Leu Ser Thr Glu Gly Leu 
285 290 

Glu lie Asn Arg Gly Arg Thr Glu Trp Arg Lys Lys Ala Pro Arg 

300 305 * 



(2) INFORMATION FOR SEQ IB NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TIPS: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Ara Val Glu Ala Pro Gly Gly Thr Leu Ala Asp Arg Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO; 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:€: 

lie Glu lie Tyr Lys Tyr Pro Ala Trp Leu Asp He Val Glu lie 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO:7; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Val Glu Ala Pro Gly Gly Thr Leu Ala Asp 
1 5 10 

(2) INFORMATION FOR SEQ ID HO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GTKGARGCNC CHGGWGGNAC NYTKGCAKA 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 15 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/mod_base- i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 9 ■ 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/mod_base- i 

(ix) FEATURE: j ^ ^ 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 18 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/modjbase- i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 21 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/mo debase- i 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTKGARGCNC CWGGNGGNAC NYTKGCAKA 
(2) INFORMATION TOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 14 amino acids 
(B) TYPE: amino acid 
<D) TOPOLOGI: linear 

(ii> MOLECULE TOPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

He Glu He Tyr Lys Tyr Pro Ala Trp Leu Asp lie Glu lie 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
ATNGARATNT AYAARTAYCC NKCNTGGYTN GAYATNGARA TN 42 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: , ^ . 

(A) NAME/KEY: modi£ied_base 

(B) LOCATION: 3 J _ , 
fC> IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 
/mod_base« i 

(ix) FEATURE: 

(A) NAME/KEY: modifiedjaase 

(B) LOCATION: S 

fCV IDENTIFICATION METHOD: experimental 
(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 
/mod base- i 
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(ix) FEATURE: 

(A) NAME/KEY: modifi d base 
<B) LOCATION: 21 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION : /evidence- EXPERIMENTAL 

/mod_baae- i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 24 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/mod_base- i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 30 

(C) IDENTIFICATION METHOD: experimental 

(V) OTHER INFORMATION : /evidence- EXPERIMENTAL 
/mod^base- i 

(ix) FEATURE: 

(A) NAME /KEY: modified base 

(B) LOCATION: 36 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/xaod_base- i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ATNGARATNT ATAARZATCC NGCNTGGTTN GATATNGARA T 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTGTGGAAGC GATACAGGGT GGTGCCGAGG C 31 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

15 

TAYAARGARA ARTTY 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pair* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

21 

AARTGGGTNA TGATGAAYCA A 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

21 

YTGRTTCATC ATNACCCAYT T 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

24 

YTGYTGRTGY TCRXGYTTMT CYTC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AAAAATCTAG AAGCTTTCGT GCCATGGCTT GGACC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGCGTACCGG GATCCGCCTC TA 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



ATGACCATGA 


TTACGCCAAG 


CTCGAAATTA 


ACCCTCACTA 


AAGGGAACAA AAGCTGGAGC 


€0 


TCCACCGCGG 


TGGCGGCCGC 


TCTAGAACTA 


GTGGATCCCC 


CGGGCTGCAG GAATTCGGCA 


120 


CGAGAAGAAC 


TTTGTTGTTC 


GTTTGATGTA 


GGTTAGGAGG 


TGGGATGTAA TCAGTTTCAG 


180 


AGCGTTGTAT 


TTTTGACTGA 


TGGGTTTGCG 


ACAACACCTA 


CCATGAGGAA ACTGAATCTC 


240 


ATTTGGGTCA 


CTTCGAGAAT 


GCACATTGAG 


ATCTACAGAT 


ATCCAGCTTG GTATTGTTTT 


300 


TTTTTTTTCT 


TTTTGGCTGC 


GTATGTTTTG 


ATGACAACAA 


ATGAGTTGAA TTCTTAAAAA 


360 


TTTTGGTTAC 


AGGGGTGATG 


TGGTTGTCAG 


AGTGAAGAAG 


GATAGCGACA AGGCGTGACT 


420 


GGATTCTTAA 


GGACATTGCT 


AACCGGCGAA 


TTCACTGGCC 


GCAGTACTAG GTTTCCTTCT 


480 


CATCATTGTT 


TGCTTTCTCC 


ATTGGTTTGT 


GCAATGGAAT 


AAAATTTTCT TATGTTAAAG 


540 


ATATAAGTTT 


CTGTCACTTG 


GGTTTATGGG 


ACTGTCCTGA 


TTAGTTGTAC CTATGTGTTA 


600 


CCGTTTCAGC 


AAGTAGGTGA 


TGATGAACCA 


AGACACAAGA 


CGGCTACAGA AAGTTTCTGA 


660 


TGATGTTCGG 


GACGAGCACT 


TGATGTTTTG 


TCCTAAAGAA 


CCCAGGTAAA AGAACTTTGT 


720 


GCCAATGCAA 


TGTTTGCTGG 


TCAATCATAT 


CGTTATATTC 


ATGAATTGCC AACTATTCTG 


780 
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TTTATTGTAT ATCTTTGTAG ATTAGCATAT CCTGAGGAGG AAAATACCAG AAGCTTGAAG 840 

AATATCCCCA AACTCGAAGA TCTGGCCAAG TACTCAATCA TTGGACTTAA GGTATAAAAT 900 

AGAACAATAA GATTCTTTGT AAGAATCAAC ATTCCTAAAG GACTTTATAA TCATGTTTCT 960 

TTGCAGCCAA GAGCGAGCTG ATCTCGGCAT GAACCATCAT GTCAATAATG TCACATATAT 1020 

TGGATGGCTT CTTGAGGTTA GTGTCATCAT CAGCTTCAGT AATAATCATA TGAGCATACC 1080 

TCAAGAGTTA TAGACACGCA CGAACTTCAG GTCATAACTT TGGATTACAG ACGAGAATTT 1140 

AGCAAGACGA TGTGGTGGAT TCATTGACCA CCTCAAAGAA TGGCTCTGCA ACATCAGGCA 1200 

CACAAAGCCA CAACAATACC CAGTTCTTAC ATCTCCTAAG GTTGTAGGTT GAAAGAACTA 1260 

TGAAGTGGTG AGCTGCAGAT CTTTGCATGT GCAGAGGGTT GTAGGTGGGG GCCTTAGCAG 1320 

GGAGGTGTAC GTTGTGTCAT TGAATAAC7C GAGGGGGGGC CCGGTACCCA ATTCGCCC 1376 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cuphea lanceolata 

(3d.) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CCCCCGGGCT GCAGGAATTC GATTACAAGG AGAAATTCAT TGTAAGATGC TACGAGGTCG 60 

GTATTAACAA GACAGCCACC GTCGAGACCA TGGCAAATCT TTTGCAGGTC TCTTTCTTGC 120 

ATGCATGCAT CGTCAGGTTT CTGGGCATTG GTGATTTGCT TGTATTAATT TACATGTCAA 180 

ATTTAATATT TCCTTGTCTC CGACATGCAA CACCATTTTT TTTTCTTTAA ATGTTCACTT 240 

TGGATACAGG AAGTAGGTTG TAACCATGCT CAGAGTATTG GATTCTCAAC CGATG GTT T T 300 

GCGACGACCA CTACCATGAG AAAATTGAAT CTGATATGGG TTACTCGTCG AATGCACATA 360 

GAAATTTACA AGTACCCAGC ATGGTTAGTT AGTTCTTTCC ACTCTCTTTC TTCATCTCCC 420 

CAGCCACCCC ACTGCTAACT TTTTGATTGA CAATTGTTGA TACGTACTCT AGGGGTGATG 480 

TGGTTGAAAT TGAGACTTGG TGCCAAAGTG AAGGAAGAAT TGGAACAAGA AGGGATTGGA 540 

TTCTCAAGGA CTATGCTAAT GGTGATGTTA TTGGAAGAGC CACAAGGTAG ACAGACTGCT 600 

CTCTCATATA TACAGCAGTG AGAGAACAAA AGAATAATAT TGGAACAATA TCAAATCGAA 660 

TCTAAACAAT TGGAAGACAT TATTTTGAGG AAAGGGAAGA TTGAAACTGA TGTTCTTAGT 720 

AATCTATACG TGCACGGCGC CATGATTATC CATTTCATGA GAATTGTTCC AATCATTTAT 780 
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ATTAATCTGT TTTCAGCAAG TGGGTCATGA TGAATCAAAT CAAGCTTATC GATACCGTCG 840 
ACCTCGAGGG GG 852 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cuphea viscosissima 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

CCCCCTCGAG GTCGACGGTA TCGATAAGCT TGATTATAAG GAGAAGTTTA TTGTCAGATG 60 

CTACGAGGTC GGTATTAACA AGACAGCCAC CGTCGAGACC ATGGCAAATC TTTTGCAGGT 120 

CAGGTTCTCT CTGTTTCCAT ATCGTTGCAT GCATGCATCG GTTTCTGGGC ATTGGTTATT 180 

TGCTTGTATT AATTTACATG TCAAAATTTA ATTTAATATT TCCTTGTCTC CGACATGCAA 240 

CACCATTTTT TTTTTTAAAT GTTCACTTTC AATGCAGGAA GTAGGTTGTA ACCATGCTCA 300 

GAGTCTTGGA TTCTCAACCG ATGGTTTTGC GACGACCACT ACCATGAGGA AATTGAATCT 360 

GATATGGGTT ACTGCTCGAA TGCACATAGA AATTTACAAG TACCCAGCAT GGTTAGTTAG 420 

TTCTTCCACT CTCTTTTCTT CATCTCCCCA GCCACCCCAC TGCACTTTTT GATTGACAAT 480 

TGTTGGATAC GTCTCTAGGG GTGATGTGGT TGAAATTGAG ACTTGGTGCC AAAGTGAAGG 540 

AAGAATCGGA ACAAGAAGGG ATTGGATTCT CAAGGACTAT GCTAATGGTG AAGTTATTGG 600 

AAGAGCCACA AGGTAGACAG ACTGCTCTCA TATATACATC AGTGAGATAA CAAAGGGAAT 660 

AATATTGGAA CAATATCAAA TCGAATCTAA ACAATTGGAA GACATTATTT TGAGCAAGTG 720 

AAGATTGAAA CTGATGTTCT TAGTAATCTA TACGTGCACG GCGCCATGAT TATCCATTTC 780 

ATGAGAATTG TTCCAATCAT TTATATTAAT CTGTTTTCAG CAAATGGGTG ATGATGAACC 840 

AAATCGAATT CCTGCAGCCC GGGGG 865 
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CLAIMS 

What is claimed is: 

5 1. An isolated nucleic acid fragment comprising a S 

nucleotide sequence encoding a plant acyl-ACP * 
thioesterase . ; 

2. An isolated nucleic acid fragment of Claim 1 
10 wherein said fragment is isolated from a plant selected 
from the group consisting of soybean, oil producing 
Brassica species / f:»phea gj Bfsosissima and Cuphea 

z 

lanreolata. 

15 3. An isolated nucleic acid fragment comprising a 

nucleotide sequence encoding the soybean seed acyl-ACP 
thioesterase cDNA corresponding to the nucleotides 1 to 
1602 of SEQ ID NO: l f or any nucleic acid fragment 
substantially homologous therewith. 

20 

4 . An isolated nucleic acid fragment comprising a 
nucleotide sequence encoding the soybean seed acyl-ACP 
thioesterase cDNA corresponding to the nucleotides 1 to 

1476 of SEQ ID N0:3, or any nucleic acid fragment j 
25 substantially homologous therewith. \ 

5. An isolated nucleic acid fragment of Claim 3 
wherein said nucleotide sequence encodes the soybean 
seed acyl-ACP thioesterase precursor corresponding to 

30 nucleotides 106 to 1206 of SEQ ID N0:1, or any nucleic 
acid fragment substantially homologous therewith. 

6. An isolated nucleic acid fragment of Claim 4 

wherein said nucleotide sequence encodes the soybean ' 
35 seed acyl-ACP thioesterase precursor corresponding to 
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nucleotides 117 to 1217 of SEQ ID NO: 3, or any nucleic 
acid fragment substantially homologous th rewith. 

7, An isolat d nucleic acid fragment of Claim 5 
wherein the said nucleotide sequence encodes the mature 
soybean seed acyl-ACP thioesterase enzyme corresponding 
to nucleotides 271 to 1206 of SEQ ID N0:1, or any 
nucleic acid fragment substantially homologous 
therewith . 



8. An isolated nucleic acid fragment of Claim 6 
wherein the said nucleotide sequence encodes the mature 
soybean seed acyl-ACP thioesterase enzyme corresponding 
to nucleotides 282 to 1217 of SEQ ID NO: 3, or any 

15 nucleic acid fragment substantially homologous 
therewith . 

9. A chimeric gene causing altered levels of 
acyl-ACP thioesterase activity in a transformed plant 

20 cell, the gene comprising a nucleic acid fragment of 
Claim 1 operably linked to suitable regulatory 
sequences. 

10. A chimeric gene causing altered levels of 
25 mature seed acyl-ACP thioesterase enzyme in a 

transformed microorganism, the gene comprising a nucleic 
acid fragment of Claim 7 operably linked to suitable 
regulatory sequences . 

30 11. A plant transformed with the chimeric gene of 

Claim 9. 

12. Oil obtained from the plants containing the 
chimeric genes of Claim 9. 

35 
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13. A method of producing soybean seed oil 
containing altered 1 vels of palmitic and st aric acids 
comprising: 

(a) transforming a plant cell of an oil-producing j_ 
5 species with a chimeric gene of Claim 9, p 

(b) growing fertile soybean plants from the ? 
transformed plant cells of step (a) , 

(c) screening progeny seeds from the fertile 
soybean plants of step (b) for the desired levels of 

10 palmitic and stearic acids, and 

(d) processing the progeny seed of step (c) to 
obtain oil containing altered levels of palmitic and 

stearic acids. * 

15 14. A method of Claim 13 wherein the plant cell of 

an oil-producing species is selected from the group 
consisting of soybean, oil seed Brassica species, 
sunflower, cotton, cocoa, peanut, saf flower, and corn. 

20 15, A method of producing mature soybean seed 

acyl-ACP thioesterase enzyme in microorganisms 
comprising: 

(a) transforming a microorganism with a chimeric ' 
gene of Claim 10, and j 
25 (b) growing the transformed microorganism of step \ 

(a) to produce mature soybean seed acyl-ACP thioesterase 
enzyme. 

16. A method of RFLP breeding to obtain altered 
30 levels of palmitic and stearic acids in soybean seed oil 
comprising: 

(a) making a cross between two soybean varieties 
differing in the trait, 
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(b) making a Southern blot of restriction enzyme 
digested genomic DNA isolated from several progeny 
plants resulting from the cross of step (a); and 

(c) hybridizing the Southern blot with the 
radiolabeled nucleic acid fragment of Claim 3 or 4. 



